FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark fftw.

Project Site

fftw.org

Test Created

22 January 2015

Last Updated

16 August 2017

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Minute, 59 Seconds

Average Run Time

2 Minutes, 54 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

100k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsFFTW Popularity Statisticspts/fftw2015.012015.052015.092016.012016.052016.092017.012017.052017.092018.012018.052018.092019.012019.052019.092020.012020.052020.092021.012021.052021.092022.012022.052022.092023.012023.052023.092024.012024.052024.0920K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 19 September 2024.
Float + SSE49.2%Stock50.8%Build Option PopularityOpenBenchmarking.org
2D FFT Size 1285.5%2D FFT Size 5125.6%1D FFT Size 10245.6%1D FFT Size 5125.4%2D FFT Size 2565.3%2D FFT Size 645.3%2D FFT Size 10245.8%1D FFT Size 1285.4%2D FFT Size 326.4%2D FFT Size 409611.0%1D FFT Size 2565.2%1D FFT Size 20485.5%2D FFT Size 20485.7%1D FFT Size 328.2%1D FFT Size 40968.8%1D FFT Size 645.3%Size Option PopularityOpenBenchmarking.org

Revision History

pts/fftw-1.2.0   [View Source]   Wed, 16 Aug 2017 10:29:55 GMT
Update against fftw 3.3.6, add AVX2/AVX512 enables

pts/fftw-1.1.0   [View Source]   Sat, 24 Jan 2015 12:28:44 GMT
Switch to using Mflops as a scale.

pts/fftw-1.0.0   [View Source]   Thu, 22 Jan 2015 11:35:11 GMT
Initial commit of fftw.

Suites Using This Test

C/C++ Compiler Tests

HPC - High Performance Computing

CPU Massive

Scientific Computing


Performance Metrics

Analyze Test Configuration:

FFTW 3.3.6

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.org metrics for this test profile configuration based on 1,725 public results since 16 August 2017 with the latest data as of 19 September 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Mflops (Average)
100th
8
42591 +/- 1527
100th
9
40273 +/- 3112
100th
6
36856 +/- 1015
99th
3
33260 +/- 1940
97th
13
31447 +/- 478
97th
29
31355 +/- 945
95th
19
30643 +/- 2585
95th
9
30085 +/- 530
94th
3
29469 +/- 290
93rd
8
28495 +/- 3090
92nd
22
27469 +/- 1236
92nd
12
27355 +/- 1686
92nd
6
26974 +/- 1193
91st
6
26537 +/- 2254
91st
3
26366 +/- 586
89th
16
24865 +/- 1852
89th
12
24498 +/- 1979
88th
8
24188 +/- 2629
88th
6
23759 +/- 1005
85th
4
22290 +/- 3181
83rd
4
20881 +/- 1536
82nd
6
20769 +/- 539
82nd
5
20711 +/- 1653
82nd
4
20530 +/- 746
82nd
5
20526 +/- 2273
81st
29
20358 +/- 1844
81st
8
20085 +/- 637
80th
6
19907 +/- 1868
79th
6
19556 +/- 2026
78th
3
19208 +/- 1178
77th
16
19078 +/- 1669
77th
14
18895 +/- 1080
76th
5
18780 +/- 474
76th
3
18734 +/- 456
Mid-Tier
75th
< 18485
74th
30
18276 +/- 1066
72nd
4
17830 +/- 1639
71st
3
17576 +/- 1101
70th
7
17353 +/- 535
69th
7
17295 +/- 1320
68th
9
17161 +/- 1602
68th
3
17084 +/- 239
68th
4
16990 +/- 278
67th
16
16944 +/- 1200
64th
6
16332 +/- 1325
62nd
22
16162 +/- 110
61st
8
16037 +/- 1416
60th
12
15832 +/- 1326
59th
5
15759 +/- 1279
59th
11
15736 +/- 986
59th
4
15703 +/- 621
59th
18
15594 +/- 1013
58th
3
15565 +/- 1340
58th
4
15494 +/- 948
58th
6
15423 +/- 1088
58th
6
15401 +/- 1117
57th
6
15311 +/- 872
57th
6
15286 +/- 787
57th
6
15256 +/- 1243
57th
7
15211 +/- 1062
56th
6
15142 +/- 1061
56th
6
15100 +/- 1205
55th
4
15039 +/- 1188
51st
5
14747 +/- 371
Median
50th
14633
49th
3
14473 +/- 840
48th
4
14401 +/- 1317
48th
7
14342 +/- 476
48th
6
14294 +/- 205
47th
11
14218 +/- 1596
42nd
17
13107 +/- 313
41st
12
13000 +/- 1904
40th
22
12849 +/- 520
40th
3
12832 +/- 380
40th
3
12719 +/- 1487
39th
4
12710 +/- 999
38th
3
12344 +/- 492
37th
6
12245 +/- 334
36th
5
11882 +/- 98
34th
3
11169 +/- 393
33rd
5
10693 +/- 268
33rd
13
10636 +/- 148
30th
12
10038 +/- 316
28th
3
9085 +/- 116
27th
3
8666 +/- 143
Low-Tier
25th
< 7595
23rd
31
6579 +/- 338
21st
4
6188 +/- 88
20th
5
5192 +/- 387
16th
3
4810 +/- 106
13th
29
3558 +/- 23
7th
28
2551 +/- 125
3rd
3
1156 +/- 8
OpenBenchmarking.orgDistribution Of Public Results - Build: Float + SSE - Size: 2D FFT Size 40961725 Results Range From 95 To 52279 Mflops951139218332274271531563597403844794911053511579126231366714711157551679917843188871993120975220192306324107251512619527239282832932730371314153245933503345473559136635376793872339767408114185542899439434498746031470754811949163502075125152295306090120150

Based on OpenBenchmarking.org data, the selected test / test configuration (FFTW 3.3.6 - Build: Float + SSE - Size: 2D FFT Size 4096) has an average run-time of 23 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkBuild: Float + SSE - Size: 2D FFT Size 4096Run-Time20406080100Min: 3 / Avg: 22.28 / Max: 89

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsBuild: Float + SSE - Size: 2D FFT Size 4096Deviation3691215Min: 0 / Avg: 0.96 / Max: 7

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseFFTW CPU Core ScalingBuild: Float + SSE - Size: 2D FFT Size 40962468101216243248640.71881.43762.15642.87523.594

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
SPARC64
sparc64
(Many Processors)
IBM Z
s390x
(Many Processors)
IBM POWER (PowerPC) 64-bit
ppc64le
POWER9 16-Core
MIPS 64-bit
mips64
ICT Loongson-3A R3
Intel / AMD x86 32-bit
i686
(Many Processors)
ARMv7 32-bit
armv7l
ARMv7 rev 4 4-Core
DEC Alpha
alpha
Alpha
ARMv8 64-bit
aarch64
ARMv8, ARMv8 Cortex-A76 4-Core, ARMv8 Neoverse-N1 160-Core

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 45 Benchmark Results

2 x AMD EPYC 7443 24-Core - Dell PowerEdge R7525 [03WYW4] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

1 System - 45 Benchmark Results

2 x AMD EPYC 7302 16-Core - Dell PowerEdge R7525 [0H3K7P] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

1 System - 45 Benchmark Results

2 x AMD EPYC 7302 16-Core - Dell PowerEdge R7525 [0590KW] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

1 System - 45 Benchmark Results

2 x AMD EPYC 7443 24-Core - Dell PowerEdge R7525 [03WYW4] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

20 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-503.el9.x86_64 - X Server

19 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-503.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

18 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-474.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

17 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

Most Popular Test Results

OpenBenchmarking.org Results Compare

6 Systems - 1421 Benchmark Results

Unknown - Marvell Armada 3720 Board - 2048MB

Ubuntu 16.04 - 4.4.52-armada-17.06.2-g12feccb - GCC 5.4.0 20160609

12 Systems - 593 Benchmark Results

AMD Ryzen 7 5800X 8-Core - Gigabyte X570 AORUS MASTER - AMD Starship

Fedora 33 - 5.8.16-300.fc33.x86_64 - GNOME Shell 3.38.1

4 Systems - 131 Benchmark Results

Intel Core i7-10700T - Insyde CometLake TBD by OEM - Intel

FreeBSD - 13.0-BETA1 - Clang 11.0.1

8 Systems - 439 Benchmark Results

Intel Core i5-10600K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH

Ubuntu 21.04 - 5.12.0-051200rc3daily20210315-generic - GNOME Shell 3.38.3

2 Systems - 123 Benchmark Results

Intel Core i7-8700K - ASUS TUF Z370-PLUS GAMING - Intel 8th Gen Core

ManjaroLinux 18.0.4 - 4.19.49-1-MANJARO - Xfce 4.13

3 Systems - 56 Benchmark Results

AMD Ryzen 3 2200G with Radeon Vega - Gigabyte AX370-Gaming 5 - AMD Device 15d0

Ubuntu 17.10 - 4.15.1-041501-generic - GNOME Shell 3.26.2

3 Systems - 376 Benchmark Results

2 x AMD EPYC 7F72 24-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.10 - 5.11.0-rc4-max-boost-inv-patch - GNOME Shell 3.38.1

1 System - 62 Benchmark Results

ICT Loongson-3A R3 - Unknown - AMD RS780 + SB7x0

Loongnix 1.0 - 3.10.84-16.fc21.loongson.mips64el - MATE 1.8.1

2 Systems - 403 Benchmark Results

Intel Core i9-10900K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH

Ubuntu 20.04 - 5.4.0-48-generic - GNOME Shell 3.36.4

6 Systems - 79 Benchmark Results

AMD Ryzen Threadripper 2990WX 32-Core - ASUS ROG ZENITH EXTREME - AMD 17h

Ubuntu 18.04 - 4.18.0-18-generic - GNOME Shell 3.28.3

Find More Test Results