FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark fftw.

Project Site

fftw.org

Test Created

22 January 2015

Last Updated

16 August 2017

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Minute, 50 Seconds

Average Run Time

2 Minutes, 54 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

100k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsFFTW Popularity Statisticspts/fftw2015.012015.052015.092016.012016.052016.092017.012017.052017.092018.012018.052018.092019.012019.052019.092020.012020.052020.092021.012021.052021.092022.012022.052022.092023.012023.052023.092024.012024.0516K32K48K64K80K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 23 July 2024.
Float + SSE49.2%Stock50.8%Build Option PopularityOpenBenchmarking.org
2D FFT Size 1285.4%2D FFT Size 5125.6%1D FFT Size 10245.6%1D FFT Size 5125.3%2D FFT Size 2565.2%2D FFT Size 645.2%2D FFT Size 10245.8%1D FFT Size 1285.3%2D FFT Size 326.4%2D FFT Size 409611.5%1D FFT Size 2565.1%1D FFT Size 20485.4%2D FFT Size 20485.7%1D FFT Size 328.4%1D FFT Size 40969.1%1D FFT Size 645.2%Size Option PopularityOpenBenchmarking.org

Revision History

pts/fftw-1.2.0   [View Source]   Wed, 16 Aug 2017 10:29:55 GMT
Update against fftw 3.3.6, add AVX2/AVX512 enables

pts/fftw-1.1.0   [View Source]   Sat, 24 Jan 2015 12:28:44 GMT
Switch to using Mflops as a scale.

pts/fftw-1.0.0   [View Source]   Thu, 22 Jan 2015 11:35:11 GMT
Initial commit of fftw.

Suites Using This Test

C/C++ Compiler Tests

HPC - High Performance Computing

CPU Massive

Scientific Computing


Performance Metrics

Analyze Test Configuration:

FFTW 3.3.6

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.org metrics for this test profile configuration based on 1,631 public results since 16 August 2017 with the latest data as of 26 July 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Mflops (Average)
100th
8
42591 +/- 1527
100th
9
40273 +/- 3112
100th
6
36856 +/- 1015
99th
3
33260 +/- 1940
97th
13
31447 +/- 478
97th
29
31355 +/- 945
95th
19
30643 +/- 2585
94th
9
30085 +/- 530
94th
3
29469 +/- 290
93rd
8
28495 +/- 3090
92nd
22
27469 +/- 1236
92nd
12
27355 +/- 1686
91st
6
26974 +/- 1193
91st
6
26537 +/- 2254
91st
3
26366 +/- 586
89th
16
24865 +/- 1852
88th
12
24498 +/- 1979
88th
8
24188 +/- 2629
87th
6
23759 +/- 1005
85th
4
22290 +/- 3181
82nd
4
20881 +/- 1536
81st
6
20769 +/- 539
81st
5
20711 +/- 1653
81st
4
20530 +/- 746
81st
5
20526 +/- 2273
80th
29
20358 +/- 1844
80th
8
20085 +/- 637
79th
6
19907 +/- 1868
78th
6
19556 +/- 2026
77th
3
19208 +/- 1178
76th
16
19078 +/- 1669
Mid-Tier
75th
< 18926
75th
14
18895 +/- 1080
75th
5
18780 +/- 474
75th
3
18734 +/- 456
73rd
30
18276 +/- 1066
70th
4
17830 +/- 1639
69th
3
17576 +/- 1101
68th
7
17353 +/- 535
67th
7
17295 +/- 1320
67th
9
17161 +/- 1602
66th
3
17084 +/- 239
66th
4
16990 +/- 278
65th
16
16944 +/- 1200
62nd
6
16332 +/- 1325
60th
22
16162 +/- 110
59th
8
16037 +/- 1416
58th
12
15832 +/- 1326
57th
5
15759 +/- 1279
57th
11
15736 +/- 986
57th
4
15703 +/- 621
56th
18
15594 +/- 1013
56th
3
15565 +/- 1340
56th
4
15494 +/- 948
56th
6
15423 +/- 1088
56th
6
15401 +/- 1117
55th
6
15311 +/- 872
55th
6
15286 +/- 787
55th
6
15256 +/- 1243
54th
7
15211 +/- 1062
54th
6
15142 +/- 1061
53rd
6
15100 +/- 1205
53rd
4
15039 +/- 1188
Median
50th
14940
49th
5
14747 +/- 371
46th
3
14473 +/- 840
46th
4
14401 +/- 1317
45th
7
14342 +/- 476
45th
6
14294 +/- 205
44th
11
14218 +/- 1596
39th
17
13107 +/- 313
38th
12
13000 +/- 1904
37th
22
12849 +/- 520
37th
3
12832 +/- 380
36th
3
12719 +/- 1487
36th
4
12710 +/- 999
34th
3
12344 +/- 492
34th
6
12245 +/- 334
32nd
5
11882 +/- 98
31st
3
11169 +/- 393
30th
5
10693 +/- 268
29th
13
10636 +/- 148
27th
12
10038 +/- 316
Low-Tier
25th
< 9471
25th
3
9085 +/- 116
24th
3
8666 +/- 143
20th
19
6496 +/- 414
20th
21
6431 +/- 47
19th
4
6188 +/- 88
18th
5
5192 +/- 387
14th
3
4810 +/- 106
11th
29
3558 +/- 23
6th
28
2551 +/- 125
3rd
3
1156 +/- 8
OpenBenchmarking.orgDistribution Of Public Results - Build: Float + SSE - Size: 2D FFT Size 40961616 Results Range From 95 To 52279 Mflops951139218332274271531563597403844794911053511579126231366714711157551679917843188871993120975220192306324107251512619527239282832932730371314153245933503345473559136635376793872339767408114185542899439434498746031470754811949163502075125152295306090120150

Based on OpenBenchmarking.org data, the selected test / test configuration (FFTW 3.3.6 - Build: Float + SSE - Size: 2D FFT Size 4096) has an average run-time of 24 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkBuild: Float + SSE - Size: 2D FFT Size 4096Run-Time20406080100Min: 3 / Avg: 23.08 / Max: 92

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsBuild: Float + SSE - Size: 2D FFT Size 4096Deviation3691215Min: 0 / Avg: 1 / Max: 7

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseFFTW CPU Core ScalingBuild: Float + SSE - Size: 2D FFT Size 40962468101216243248640.71881.43762.15642.87523.594

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
SPARC64
sparc64
(Many Processors)
IBM Z
s390x
(Many Processors)
IBM POWER (PowerPC) 64-bit
ppc64le
POWER9 16-Core
MIPS 64-bit
mips64
ICT Loongson-3A R3
Intel / AMD x86 32-bit
i686
(Many Processors)
ARMv7 32-bit
armv7l
ARMv7 rev 4 4-Core
DEC Alpha
alpha
Alpha
ARMv8 64-bit
aarch64
ARMv8, ARMv8 Cortex-A76 4-Core, ARMv8 Neoverse-N1 160-Core

Recent Test Results

OpenBenchmarking.org Results Compare

6 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

6 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

1 System - 32 Benchmark Results

2 x AMD EPYC 9J14 96-Core - Oracle Asm MB+Tray E5-2c - AMD Device 14a4

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

6 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

5 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

1 System - 32 Benchmark Results

2 x AMD EPYC 9J14 96-Core - Oracle Asm MB+Tray E5-2c - AMD Device 14a4

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

1 System - 57 Benchmark Results

AMD Ryzen 9 7950X3D 16-Core - ASUS TUF GAMING X670E-PLUS WIFI - AMD Device 14d8

Ubuntu 22.04 - 6.5.0-44-generic - GNOME Shell 42.9

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

5 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

1 System - 32 Benchmark Results

2 x AMD EPYC 9J14 96-Core - Oracle Asm MB+Tray E5-2c - AMD Device 14a4

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

5 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

1 System - 32 Benchmark Results

2 x AMD EPYC 9J14 96-Core - Oracle Asm MB+Tray E5-2c - AMD Device 14a4

Ubuntu 22.04 - 6.5.0-1025-oracle - 1.3.255

Most Popular Test Results

OpenBenchmarking.org Results Compare

6 Systems - 1421 Benchmark Results

Unknown - Marvell Armada 3720 Board - 2048MB

Ubuntu 16.04 - 4.4.52-armada-17.06.2-g12feccb - GCC 5.4.0 20160609

12 Systems - 593 Benchmark Results

AMD Ryzen 5 3600X 6-Core - MSI X470 GAMING M7 AC - AMD Starship

Ubuntu 20.04 - 5.8.0-050800daily20200622-generic - GNOME Shell 3.36.2

2 Systems - 123 Benchmark Results

Intel Core i7-8700K - ASUS TUF Z370-PLUS GAMING - Intel 8th Gen Core

Clear Linux OS 29920 - 5.1.9-781.native - GNOME Shell 3.32.2

4 Systems - 131 Benchmark Results

Intel Core i7-10700T - Insyde CometLake TBD by OEM - Intel

FreeBSD - 13.0-BETA1 - Clang 11.0.1

8 Systems - 439 Benchmark Results

Intel Core i5-10600K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH

Ubuntu 21.04 - 5.12.0-051200rc3daily20210315-generic - GNOME Shell 3.38.3

3 Systems - 56 Benchmark Results

AMD Ryzen 5 2400G with Radeon Vega - Gigabyte AX370-Gaming 5 - AMD Device 15d0

Ubuntu 17.10 - 4.15.1-041501-generic - GNOME Shell 3.26.2

3 Systems - 376 Benchmark Results

2 x AMD EPYC 7F72 24-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.10 - 5.10.9-051009-generic - GNOME Shell 3.38.1

1 System - 62 Benchmark Results

ICT Loongson-3A R3 - Unknown - AMD RS780 + SB7x0

Loongnix 1.0 - 3.10.84-16.fc21.loongson.mips64el - MATE 1.8.1

2 Systems - 475 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship

Ubuntu 19.10 - 5.3.0-18-generic - GNOME Shell 3.34.1

3 Systems - 301 Benchmark Results

Intel Core i5-10600K - ASUS PRIME Z490M-PLUS - Intel Comet Lake PCH

Ubuntu 20.04 - 5.4.0-40-generic - GNOME Shell 3.36.3

Find More Test Results