FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark fftw.

Project Site

fftw.org

Test Created

22 January 2015

Last Updated

16 August 2017

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Minute, 59 Seconds

Average Run Time

2 Minutes, 54 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

100k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsFFTW Popularity Statisticspts/fftw2015.012015.052015.092016.012016.052016.092017.012017.052017.092018.012018.052018.092019.012019.052019.092020.012020.052020.092021.012021.052021.092022.012022.052022.092023.012023.052023.092024.012024.052024.0920K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 19 September 2024.
Float + SSE49.2%Stock50.8%Build Option PopularityOpenBenchmarking.org
2D FFT Size 1285.5%2D FFT Size 5125.6%1D FFT Size 10245.6%1D FFT Size 5125.4%2D FFT Size 2565.3%2D FFT Size 645.3%2D FFT Size 10245.8%1D FFT Size 1285.4%2D FFT Size 326.4%2D FFT Size 409611.0%1D FFT Size 2565.2%1D FFT Size 20485.5%2D FFT Size 20485.7%1D FFT Size 328.2%1D FFT Size 40968.8%1D FFT Size 645.3%Size Option PopularityOpenBenchmarking.org

Revision History

pts/fftw-1.2.0   [View Source]   Wed, 16 Aug 2017 10:29:55 GMT
Update against fftw 3.3.6, add AVX2/AVX512 enables

pts/fftw-1.1.0   [View Source]   Sat, 24 Jan 2015 12:28:44 GMT
Switch to using Mflops as a scale.

pts/fftw-1.0.0   [View Source]   Thu, 22 Jan 2015 11:35:11 GMT
Initial commit of fftw.

Suites Using This Test

C/C++ Compiler Tests

HPC - High Performance Computing

CPU Massive

Scientific Computing


Performance Metrics

Analyze Test Configuration:

FFTW 3.3.6

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.org metrics for this test profile configuration based on 1,367 public results since 16 August 2017 with the latest data as of 19 September 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Mflops (Average)
100th
22
24220 +/- 484
100th
8
18313 +/- 199
99th
3
14123 +/- 1548
98th
10
14001 +/- 1968
97th
3
13577 +/- 334
95th
34
12747 +/- 1328
95th
9
12652 +/- 1271
94th
15
12181 +/- 757
92nd
12
11705 +/- 158
92nd
13
11683 +/- 1000
90th
5
11267 +/- 151
89th
4
11012 +/- 136
87th
4
10902 +/- 10
86th
14
10644 +/- 1191
85th
10
10073 +/- 807
85th
6
9973 +/- 415
83rd
8
9436 +/- 142
81st
20
9141 +/- 833
81st
5
9113 +/- 242
80th
8
9071 +/- 333
78th
7
8870 +/- 270
78th
3
8865 +/- 164
77th
8
8799 +/- 36
76th
5
8749 +/- 73
Mid-Tier
75th
< 8739
75th
3
8676 +/- 100
74th
3
8611 +/- 53
74th
7
8611 +/- 468
74th
3
8544 +/- 27
71st
5
8124 +/- 1024
70th
3
8070 +/- 965
57th
10
7455 +/- 847
57th
3
7402 +/- 46
55th
88
7227 +/- 1077
54th
3
7094 +/- 683
53rd
10
6977 +/- 216
53rd
5
6974 +/- 46
53rd
3
6926 +/- 38
52nd
3
6828 +/- 910
Median
50th
6788
50th
3
6779 +/- 13
49th
5
6709 +/- 223
47th
7
6667 +/- 31
47th
6
6654 +/- 13
42nd
4
6299 +/- 604
36th
3
5953 +/- 14
35th
7
5803 +/- 166
33rd
4
5577 +/- 145
31st
12
5377 +/- 156
28th
4
5089 +/- 38
26th
3
4759 +/- 190
Low-Tier
25th
< 4746
23rd
3
4350 +/- 253
21st
4
4010 +/- 95
21st
3
3957 +/- 53
20th
31
3921 +/- 16
7th
28
1133 +/- 25
3rd
3
674 +/- 3
OpenBenchmarking.orgDistribution Of Public Results - Build: Stock - Size: 1D FFT Size 40961367 Results Range From 206 To 25050 Mflops20670312001697219426913188368541824679517656736170666771647661815886559152964910146106431114011637121341263113128136251412214619151161561316110166071710417601180981859519092195892008620583210802157722074225712306823565240622455925056306090120150

Based on OpenBenchmarking.org data, the selected test / test configuration (FFTW 3.3.6 - Build: Stock - Size: 1D FFT Size 4096) has an average run-time of 2 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkBuild: Stock - Size: 1D FFT Size 4096Run-Time246810Min: 1 / Avg: 1 / Max: 1

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.3%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsBuild: Stock - Size: 1D FFT Size 4096Deviation246810Min: 0 / Avg: 0.34 / Max: 4

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseFFTW CPU Core ScalingBuild: Stock - Size: 1D FFT Size 40964681012162432640.66671.33342.00012.66683.3335

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
SPARC64
sparc64
(Many Processors)
IBM Z
s390x
(Many Processors)
IBM POWER (PowerPC) 64-bit
ppc64le
POWER9 16-Core, POWER9 altivec supported 44-Core
Intel / AMD x86 32-bit
i686
(Many Processors)
ARMv7 32-bit
armv7l
ARMv7 rev 3 4-Core, ARMv7 rev 4 4-Core
DEC Alpha
alpha
Alpha
ARMv8 64-bit
aarch64
ARMv8, ARMv8 Cortex-A53 4-Core, ARMv8 Cortex-A72 6-Core, ARMv8 Cortex-A76 4-Core, ARMv8 Neoverse-N1 160-Core, ARMv8 rev 0 8-Core

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 45 Benchmark Results

2 x AMD EPYC 7443 24-Core - Dell PowerEdge R7525 [03WYW4] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

1 System - 45 Benchmark Results

2 x AMD EPYC 7302 16-Core - Dell PowerEdge R7525 [0H3K7P] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

1 System - 45 Benchmark Results

2 x AMD EPYC 7302 16-Core - Dell PowerEdge R7525 [0590KW] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

1 System - 45 Benchmark Results

2 x AMD EPYC 7443 24-Core - Dell PowerEdge R7525 [03WYW4] - AMD Starship

Red Hat Enterprise Linux 9.4 - 5.14.0-427.33.1.el9_4.x86_64 - X Server

20 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-474.el9.x86_64 - X Server

19 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-503.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

18 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-474.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

17 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-467.el9.x86_64 - X Server

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

1 System - 32 Benchmark Results

ARMv8 Neoverse-N1 - Oracle TLA MB TRAY A1-2c - 1008GB

Ubuntu 22.04 - 6.5.0-1026-oracle - 1.3.255

Most Popular Test Results

OpenBenchmarking.org Results Compare

6 Systems - 1421 Benchmark Results

Unknown - Marvell Armada 3720 Board - 2048MB

Ubuntu 16.04 - 4.4.52-armada-17.06.2-g12feccb - GCC 5.4.0 20160609

2 Systems - 1708 Benchmark Results

AMD Ryzen 3 3300X 4-Core - ASRock X570 Pro4 - AMD Starship

Ubuntu 20.04 - 5.7.0-rc6-amd-energy - GNOME Shell 3.36.2

3 Systems - 301 Benchmark Results

Intel Core i5-7600K - Gigabyte Z270M-D3H-CF - Intel Xeon E3-1200 v6

Ubuntu 20.04 - 5.4.0-40-generic - GNOME Shell 3.36.3

1 System - 748 Benchmark Results

Intel Core i7-7700K - MSI Z270 GAMING M7 - Intel Intel Kaby Lake + Z270

Ubuntu 18.04 - 4.15.0-23-generic - GNOME Shell 3.28.1

9 Systems - 244 Benchmark Results

AMD EPYC 7601 32-Core - TYAN B8026T70AE24HR - AMD Family 17h

Ubuntu 18.10 - 4.19.1-041901-generic - GNOME Shell 3.30.1

3 Systems - 777 Benchmark Results

AMD Ryzen 3 3300X 4-Core - ASRock X570 Pro4 - AMD Starship

Ubuntu 20.04 - 5.7.0-rc6-amd-energy - GNOME Shell 3.36.2

3 Systems - 44 Benchmark Results

AMD Ryzen 9 3900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Device 1480

Ubuntu 18.04 - 5.2.0-999-generic - GNOME Shell 3.28.3

6 Systems - 254 Benchmark Results

AMD Ryzen 3 PRO 4350G - ASRock B450M Pro4 - AMD Renoir Root Complex

Clear Linux OS 34550 - 5.10.32-1035.native - GNOME Shell 40.0

4 Systems - 290 Benchmark Results

2 x AMD EPYC 7742 64-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.04 - 5.8.0-44-generic - X Server 1.20.8

Find More Test Results