fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2409043-NE-FFTW120RU44
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
debug
July 13
  1 Hour, 54 Minutes
NoGVNO2
July 14
  57 Minutes
NoGVNO3
July 15
  57 Minutes
OptNoSimplO2
July 16
  17 Minutes
2 x Intel Xeon E5-2620 v2
July 17
  17 Minutes
OptNoSimplO3
July 19
  39 Minutes
OptSimplO2
July 21
  59 Minutes
OptSimplO3
July 22
  58 Minutes
OptRedO2
July 27
  57 Minutes
OptRedO3
July 28
  58 Minutes
OptPREO2
August 06
  59 Minutes
GVNO2
August 07
  58 Minutes
NewGVNO2
August 09
  58 Minutes
OptPREO3
August 12
  58 Minutes
NewGVNO333
August 28
  58 Minutes
GVNO3
August 29
  59 Minutes
PessimisticO3
September 04
  58 Minutes
Invert Hiding All Results Option
  55 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


fftw-1.2.0runProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutiondebugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO32 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x7682000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850ASUS VW1905.14.0-474.el9.x86_64 (x86_64)5.14.0-480.el9.x86_64 (x86_64)256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P5.14.0-496.el9.x86_64 (x86_64)GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.25.14.0-503.el9.x86_64 (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptNoSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptRedO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptRedO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptPREO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- GVNO2: CXXFLAGS=-O2 CFLAGS=-O2- NewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- OptPREO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO333: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- GVNO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO3: CXXFLAGS=-O3 CFLAGS=-O3Compiler Details- Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686Processor Details- Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42eSecurity Details- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3Result OverviewPhoronix Test Suite100%107%113%120%FFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWStock - 2D FFT Size 4096Float + SSE - 2D FFT Size 32Float + SSE - 2D FFT Size 4096Float + SSE - 2D FFT Size 64Float + SSE - 2D FFT Size 1024Float + SSE - 2D FFT Size 2048Float + SSE - 2D FFT Size 128Float + SSE - 1D FFT Size 32Float + SSE - 1D FFT Size 64Float + SSE - 1D FFT Size 256Float + SSE - 1D FFT Size 1024Stock - 2D FFT Size 1024Float + SSE - 1D FFT Size 128Float + SSE - 2D FFT Size 256Stock - 2D FFT Size 2048Stock - 2D FFT Size 64Float + SSE - 1D FFT Size 4096Stock - 1D FFT Size 128Float + SSE - 1D FFT Size 512Stock - 2D FFT Size 512Stock - 2D FFT Size 256Stock - 1D FFT Size 1024Stock - 1D FFT Size 32Stock - 1D FFT Size 256Float + SSE - 2D FFT Size 512Stock - 2D FFT Size 32Stock - 1D FFT Size 512Stock - 1D FFT Size 2048Float + SSE - 1D FFT Size 2048Stock - 2D FFT Size 128Stock - 1D FFT Size 4096Stock - 1D FFT Size 64

fftw-1.2.0runfftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 128fftw: Stock - 2D FFT Size 256fftw: Stock - 2D FFT Size 512fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO34998.54906.85218.74309.24420.64296.24320.43855.83660.63780.74285.64055.63914.83123.12706.62751.47287.09870.6164401498212702136791555211729114141187016266155131409110890.76459.16092.74967.14907.75211.54336.74414.64308.34344.43835.53686.73773.34282.94051.33895.43091.62723.72756.07311.4103501654815878130261398915560117431139911803161411546114186123167027.06717.95020.84897.95219.04289.84379.54290.54345.73861.63693.33797.54270.04055.03912.13107.22736.02738.97347.4104701652615952132061403215471119521151111750160591545714353122217077.76717.15008.94846.95127.94337.14358.84309.24340.13812.53671.33679.24233.74046.93909.53053.12662.02231.16805.0101311347113502127891365115289121881109411714156351554314243119036728.25603.75024.64899.55115.04336.64354.34316.14285.83848.03634.93736.54178.44046.43907.03048.52642.12239.36925.7102291339413636127401383115270122951108711715155171553914259119066665.95618.54898.04899.95144.64354.74364.24312.84314.43839.43616.93751.14198.14031.33917.43018.72664.32200.06912.79964.41341413740128501379515400125911113711778156301555114317119496731.15634.24962.74911.45217.94318.64408.44275.64334.23836.53642.83748.44276.04030.63911.53110.32726.62757.97116.0102561644415782132451445515414115781144011780159621552614337123377024.06694.64963.84905.75213.44326.34417.14247.74346.83836.43673.03759.44286.34039.53919.33083.82725.02758.97041.3103211637315981130581398315574119341137811858160371563114185121827072.16707.94957.94892.05189.34320.84415.54288.14340.03831.93622.23780.64280.94048.33923.93123.42728.12773.97252.4103541666815989130721412515294116091135611836160991556814234122027021.66697.34990.14899.15220.54337.34415.34329.54367.83821.53655.83784.44252.94037.23921.73149.12707.22767.17194.9103971651915845130001399715404119431133711709161691553814430122987059.06718.85000.04902.95144.84286.44402.84314.34366.83855.93641.63775.94267.94045.53931.33126.02710.82762.47355.910357.81634415388131671424715538117881146511821160901540714279121257005.56688.44997.44885.05179.14206.74433.24300.94298.43836.23589.53767.94232.83989.43936.03142.02716.22755.07205.19996.31664016035128311404415704117391141111841161671542814363122186983.66703.25005.74894.25213.24316.44405.04308.34334.23834.93648.03769.94263.24051.53924.23138.12727.82752.97397.6104181634415906129081415015612117961140811652162341556414438122136988.06697.25008.84893.85209.04321.44427.24241.64338.53837.33583.53785.14288.44034.43920.73085.72700.42755.47126.7103151634816141131431426615210117811148211893161431538913979121326992.76707.05021.34891.05180.44265.14294.04317.04332.93824.53610.33776.94286.74053.43934.43114.72718.02745.87316.410279.51655416110132191400915545119911154811803159391557114434119907080.86732.25019.84899.65176.44347.44327.34313.44352.13805.33654.93761.94290.64042.03950.03120.22705.02779.87243.1103721631115842131461412415576117951146711818161191542614289120647049.26724.54979.94914.15176.14280.94434.74289.24350.43812.33603.63780.84282.54063.83934.63133.02726.82774.87166.1104101648715611130191379415525118201148011821162231548114375121317085.36716.1OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32OptNoSimplO3OptRedO2OptSimplO2OptSimplO3NoGVNO2PessimisticO3OptRedO3GVNO2debugOptPREO2NewGVNO2OptPREO3OptNoSimplO2GVNO3NoGVNO3NewGVNO3332 x Intel Xeon E5-2620 v211002200330044005500SE +/- 95.20, N = 13SE +/- 31.02, N = 3SE +/- 25.51, N = 3SE +/- 39.97, N = 3SE +/- 8.35, N = 3SE +/- 15.16, N = 3SE +/- 13.97, N = 3SE +/- 13.08, N = 3SE +/- 29.89, N = 3SE +/- 15.07, N = 3SE +/- 31.02, N = 3SE +/- 18.68, N = 3SE +/- 17.04, N = 3SE +/- 26.35, N = 3SE +/- 23.10, N = 3SE +/- 2.77, N = 3SE +/- 4.74, N = 34898.04957.94962.74963.84967.14979.94990.14997.44998.55000.05005.75008.85008.95019.85020.85021.35024.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64OptNoSimplO2GVNO2NewGVNO333OptRedO2OptPREO3NewGVNO2NoGVNO3OptRedO32 x Intel Xeon E5-2620 v2GVNO3OptNoSimplO3OptPREO2OptSimplO3debugNoGVNO2OptSimplO2PessimisticO311002200330044005500SE +/- 61.09, N = 3SE +/- 14.95, N = 3SE +/- 0.13, N = 3SE +/- 7.40, N = 3SE +/- 10.53, N = 3SE +/- 2.54, N = 3SE +/- 5.21, N = 3SE +/- 8.65, N = 3SE +/- 7.41, N = 3SE +/- 6.59, N = 3SE +/- 4.42, N = 3SE +/- 11.20, N = 3SE +/- 3.52, N = 3SE +/- 2.78, N = 3SE +/- 3.97, N = 3SE +/- 6.01, N = 3SE +/- 4.79, N = 34846.94885.04891.04892.04893.84894.24897.94899.14899.54899.64899.94902.94905.74906.84907.74911.44914.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO3OptPREO2PessimisticO3GVNO3GVNO2NewGVNO333OptRedO2OptPREO3NoGVNO2NewGVNO2OptSimplO3OptSimplO2debugNoGVNO3OptRedO311002200330044005500SE +/- 6.03, N = 3SE +/- 43.81, N = 3SE +/- 41.81, N = 3SE +/- 55.21, N = 3SE +/- 36.57, N = 3SE +/- 39.92, N = 3SE +/- 33.03, N = 3SE +/- 11.03, N = 3SE +/- 32.60, N = 3SE +/- 5.69, N = 3SE +/- 12.52, N = 3SE +/- 1.29, N = 3SE +/- 12.54, N = 3SE +/- 6.97, N = 3SE +/- 7.51, N = 3SE +/- 6.35, N = 3SE +/- 11.16, N = 35115.05127.95144.65144.85176.15176.45179.15180.45189.35209.05211.55213.25213.45217.95218.75219.05220.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64GVNO2NewGVNO333PessimisticO3OptPREO2NoGVNO3debugNewGVNO2OptSimplO2OptRedO2OptPREO3OptSimplO32 x Intel Xeon E5-2620 v2NoGVNO2OptNoSimplO2OptRedO3GVNO3OptNoSimplO39001800270036004500SE +/- 51.49, N = 4SE +/- 60.33, N = 3SE +/- 60.71, N = 3SE +/- 61.56, N = 3SE +/- 46.52, N = 4SE +/- 21.54, N = 3SE +/- 22.90, N = 3SE +/- 4.27, N = 3SE +/- 22.79, N = 3SE +/- 3.34, N = 3SE +/- 21.71, N = 3SE +/- 5.04, N = 3SE +/- 4.40, N = 3SE +/- 4.58, N = 3SE +/- 18.00, N = 3SE +/- 2.62, N = 3SE +/- 7.74, N = 34206.74265.14280.94286.44289.84309.24316.44318.64320.84321.44326.34336.64336.74337.14337.34347.44354.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128NewGVNO333GVNO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO3NoGVNO3OptPREO2NewGVNO2OptSimplO2NoGVNO2OptRedO3OptRedO2OptSimplO3debugOptPREO3GVNO2PessimisticO310002000300040005000SE +/- 51.58, N = 15SE +/- 52.07, N = 15SE +/- 3.10, N = 3SE +/- 1.43, N = 3SE +/- 2.94, N = 3SE +/- 35.10, N = 9SE +/- 4.02, N = 3SE +/- 5.65, N = 3SE +/- 2.45, N = 3SE +/- 2.46, N = 3SE +/- 2.25, N = 3SE +/- 4.33, N = 3SE +/- 2.14, N = 3SE +/- 6.43, N = 3SE +/- 9.28, N = 3SE +/- 1.52, N = 3SE +/- 1.58, N = 34294.04327.34354.34358.84364.24379.54402.84405.04408.44414.64415.34415.54417.14420.64427.24433.24434.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256OptPREO3OptSimplO3OptSimplO2OptRedO2PessimisticO3NoGVNO3debugGVNO2NoGVNO2NewGVNO2OptNoSimplO2OptNoSimplO3GVNO3OptPREO22 x Intel Xeon E5-2620 v2NewGVNO333OptRedO39001800270036004500SE +/- 43.31, N = 3SE +/- 53.04, N = 3SE +/- 8.48, N = 3SE +/- 22.07, N = 3SE +/- 10.82, N = 3SE +/- 15.53, N = 3SE +/- 1.72, N = 3SE +/- 24.27, N = 3SE +/- 1.57, N = 3SE +/- 5.17, N = 3SE +/- 1.60, N = 3SE +/- 24.49, N = 3SE +/- 19.87, N = 3SE +/- 3.18, N = 3SE +/- 27.37, N = 3SE +/- 17.44, N = 3SE +/- 17.30, N = 34241.64247.74275.64288.14289.24290.54296.24300.94308.34308.34309.24312.84313.44314.34316.14317.04329.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x Intel Xeon E5-2620 v2GVNO2OptNoSimplO3debugNewGVNO333OptSimplO2NewGVNO2OptPREO3OptRedO2OptNoSimplO2NoGVNO2NoGVNO3OptSimplO3PessimisticO3GVNO3OptPREO2OptRedO39001800270036004500SE +/- 22.16, N = 3SE +/- 23.58, N = 3SE +/- 11.47, N = 3SE +/- 16.10, N = 3SE +/- 10.16, N = 3SE +/- 27.65, N = 3SE +/- 7.37, N = 3SE +/- 13.39, N = 3SE +/- 3.08, N = 3SE +/- 2.84, N = 3SE +/- 12.50, N = 3SE +/- 15.04, N = 3SE +/- 4.56, N = 3SE +/- 16.20, N = 3SE +/- 15.02, N = 3SE +/- 8.76, N = 3SE +/- 10.85, N = 34285.84298.44314.44320.44332.94334.24334.24338.54340.04340.14344.44345.74346.84350.44352.14366.84367.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128GVNO3PessimisticO3OptNoSimplO2OptRedO3NewGVNO333OptRedO2NewGVNO2NoGVNO2GVNO2OptSimplO3OptSimplO2OptPREO3OptNoSimplO32 x Intel Xeon E5-2620 v2debugOptPREO2NoGVNO38001600240032004000SE +/- 4.10, N = 3SE +/- 29.21, N = 3SE +/- 31.53, N = 3SE +/- 28.74, N = 3SE +/- 30.58, N = 3SE +/- 14.94, N = 3SE +/- 17.74, N = 3SE +/- 7.88, N = 3SE +/- 21.71, N = 3SE +/- 4.22, N = 3SE +/- 18.17, N = 3SE +/- 29.76, N = 3SE +/- 21.66, N = 3SE +/- 9.25, N = 3SE +/- 4.06, N = 3SE +/- 11.59, N = 3SE +/- 3.71, N = 33805.33812.33812.53821.53824.53831.93834.93835.53836.23836.43836.53837.33839.43848.03855.83855.93861.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256OptPREO3GVNO2PessimisticO3NewGVNO333OptNoSimplO3OptRedO22 x Intel Xeon E5-2620 v2OptPREO2OptSimplO2NewGVNO2GVNO3OptRedO3debugOptNoSimplO2OptSimplO3NoGVNO2NoGVNO38001600240032004000SE +/- 35.49, N = 3SE +/- 47.63, N = 3SE +/- 50.06, N = 3SE +/- 23.72, N = 3SE +/- 37.88, N = 3SE +/- 16.18, N = 3SE +/- 23.89, N = 3SE +/- 12.43, N = 3SE +/- 34.93, N = 3SE +/- 33.28, N = 3SE +/- 12.33, N = 3SE +/- 33.79, N = 3SE +/- 23.34, N = 3SE +/- 17.09, N = 3SE +/- 34.82, N = 3SE +/- 13.35, N = 3SE +/- 7.11, N = 33583.53589.53603.63610.33616.93622.23634.93641.63642.83648.03654.93655.83660.63671.33673.03686.73693.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512OptNoSimplO22 x Intel Xeon E5-2620 v2OptSimplO2OptNoSimplO3OptSimplO3GVNO3GVNO2NewGVNO2NoGVNO2OptPREO2NewGVNO333OptRedO2debugPessimisticO3OptRedO3OptPREO3NoGVNO38001600240032004000SE +/- 38.03, N = 5SE +/- 41.40, N = 5SE +/- 34.42, N = 12SE +/- 14.36, N = 3SE +/- 9.69, N = 3SE +/- 6.57, N = 3SE +/- 5.58, N = 3SE +/- 20.08, N = 3SE +/- 13.13, N = 3SE +/- 16.55, N = 3SE +/- 14.05, N = 3SE +/- 6.77, N = 3SE +/- 10.97, N = 3SE +/- 21.49, N = 3SE +/- 7.10, N = 3SE +/- 3.13, N = 3SE +/- 10.93, N = 33679.23736.53748.43751.13759.43761.93767.93769.93773.33775.93776.93780.63780.73780.83784.43785.13797.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2OptNoSimplO3GVNO2OptNoSimplO2OptRedO3NewGVNO2OptPREO2NoGVNO3OptSimplO2OptRedO2PessimisticO3NoGVNO2debugOptSimplO3NewGVNO333OptPREO3GVNO39001800270036004500SE +/- 59.35, N = 3SE +/- 25.83, N = 3SE +/- 48.65, N = 4SE +/- 8.01, N = 3SE +/- 32.58, N = 3SE +/- 9.92, N = 3SE +/- 14.59, N = 3SE +/- 19.30, N = 3SE +/- 11.99, N = 3SE +/- 4.77, N = 3SE +/- 10.52, N = 3SE +/- 2.87, N = 3SE +/- 5.54, N = 3SE +/- 4.50, N = 3SE +/- 7.01, N = 3SE +/- 4.55, N = 3SE +/- 8.12, N = 34178.44198.14232.84233.74252.94263.24267.94270.04276.04280.94282.54282.94285.64286.34286.74288.44290.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048GVNO2OptSimplO2OptNoSimplO3OptPREO3OptRedO3OptSimplO3GVNO3OptPREO22 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO2NoGVNO2NewGVNO2NewGVNO333NoGVNO3debugPessimisticO39001800270036004500SE +/- 47.88, N = 4SE +/- 14.04, N = 3SE +/- 6.58, N = 3SE +/- 16.83, N = 3SE +/- 22.16, N = 3SE +/- 7.07, N = 3SE +/- 14.25, N = 3SE +/- 4.44, N = 3SE +/- 11.79, N = 3SE +/- 18.65, N = 3SE +/- 14.17, N = 3SE +/- 6.84, N = 3SE +/- 12.48, N = 3SE +/- 8.15, N = 3SE +/- 8.12, N = 3SE +/- 6.96, N = 3SE +/- 9.60, N = 33989.44030.64031.34034.44037.24039.54042.04045.54046.44046.94048.34051.34051.54053.44055.04055.64063.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096NoGVNO22 x Intel Xeon E5-2620 v2OptNoSimplO2OptSimplO2NoGVNO3debugOptNoSimplO3OptSimplO3OptPREO3OptRedO3OptRedO2NewGVNO2OptPREO2NewGVNO333PessimisticO3GVNO2GVNO38001600240032004000SE +/- 8.63, N = 3SE +/- 8.30, N = 3SE +/- 1.10, N = 3SE +/- 7.30, N = 3SE +/- 13.90, N = 3SE +/- 17.36, N = 3SE +/- 13.95, N = 3SE +/- 15.54, N = 3SE +/- 4.73, N = 3SE +/- 5.34, N = 3SE +/- 4.75, N = 3SE +/- 4.58, N = 3SE +/- 11.64, N = 3SE +/- 0.15, N = 3SE +/- 5.80, N = 3SE +/- 9.96, N = 3SE +/- 8.49, N = 33895.43907.03909.53911.53912.13914.83917.43919.33920.73921.73923.93924.23931.33934.43934.63936.03950.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024OptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptSimplO3OptPREO3NoGVNO2NoGVNO3OptSimplO2NewGVNO333GVNO3debugOptRedO2OptPREO2PessimisticO3NewGVNO2GVNO2OptRedO37001400210028003500SE +/- 8.28, N = 3SE +/- 13.57, N = 3SE +/- 12.20, N = 3SE +/- 10.08, N = 3SE +/- 15.98, N = 3SE +/- 13.41, N = 3SE +/- 38.36, N = 3SE +/- 26.70, N = 3SE +/- 24.45, N = 3SE +/- 12.60, N = 3SE +/- 15.08, N = 3SE +/- 14.89, N = 3SE +/- 8.38, N = 3SE +/- 1.11, N = 3SE +/- 18.27, N = 3SE +/- 5.57, N = 3SE +/- 9.39, N = 33018.73048.53053.13083.83085.73091.63107.23110.33114.73120.23123.13123.43126.03133.03138.13142.03149.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO3OptPREO3GVNO3debugOptRedO3OptPREO2GVNO2NewGVNO333NoGVNO2OptSimplO3OptSimplO2PessimisticO3NewGVNO2OptRedO2NoGVNO36001200180024003000SE +/- 5.49, N = 3SE +/- 6.15, N = 3SE +/- 6.56, N = 3SE +/- 11.61, N = 3SE +/- 18.48, N = 3SE +/- 3.24, N = 3SE +/- 12.30, N = 3SE +/- 13.94, N = 3SE +/- 13.80, N = 3SE +/- 6.49, N = 3SE +/- 8.21, N = 3SE +/- 16.84, N = 3SE +/- 17.93, N = 3SE +/- 5.76, N = 3SE +/- 19.22, N = 3SE +/- 16.99, N = 3SE +/- 12.56, N = 32642.12662.02664.32700.42705.02706.62707.22710.82716.22718.02723.72725.02726.62726.82727.82728.12736.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2NoGVNO3NewGVNO333debugNewGVNO2GVNO2OptPREO3NoGVNO2OptSimplO2OptSimplO3OptPREO2OptRedO3OptRedO2PessimisticO3GVNO36001200180024003000SE +/- 21.99, N = 15SE +/- 16.21, N = 3SE +/- 8.09, N = 3SE +/- 16.56, N = 3SE +/- 13.80, N = 3SE +/- 5.74, N = 3SE +/- 8.68, N = 3SE +/- 5.66, N = 3SE +/- 7.45, N = 3SE +/- 14.23, N = 3SE +/- 8.89, N = 3SE +/- 5.14, N = 3SE +/- 11.48, N = 3SE +/- 13.13, N = 3SE +/- 15.05, N = 3SE +/- 23.10, N = 3SE +/- 19.40, N = 32200.02231.12239.32738.92745.82751.42752.92755.02755.42756.02757.92758.92762.42767.12773.92774.82779.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32OptNoSimplO2OptNoSimplO32 x Intel Xeon E5-2620 v2OptSimplO3OptSimplO2OptPREO3PessimisticO3OptRedO3GVNO2GVNO3OptRedO2debugNoGVNO2NewGVNO333NoGVNO3OptPREO2NewGVNO216003200480064008000SE +/- 24.17, N = 3SE +/- 71.46, N = 15SE +/- 64.34, N = 15SE +/- 90.54, N = 15SE +/- 52.52, N = 15SE +/- 54.90, N = 15SE +/- 56.02, N = 15SE +/- 57.27, N = 15SE +/- 65.67, N = 7SE +/- 82.98, N = 4SE +/- 76.45, N = 5SE +/- 5.79, N = 3SE +/- 79.17, N = 3SE +/- 72.31, N = 3SE +/- 35.65, N = 3SE +/- 98.33, N = 3SE +/- 12.13, N = 36805.06912.76925.77041.37116.07126.77166.17194.97205.17243.17252.47287.07311.47316.47347.47355.97397.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64debugOptNoSimplO3GVNO2OptNoSimplO22 x Intel Xeon E5-2620 v2OptSimplO2NewGVNO333OptPREO3OptSimplO3NoGVNO2OptRedO2OptPREO2GVNO3OptRedO3PessimisticO3NewGVNO2NoGVNO32K4K6K8K10KSE +/- 172.96, N = 15SE +/- 98.25, N = 3SE +/- 144.31, N = 15SE +/- 33.87, N = 3SE +/- 54.50, N = 3SE +/- 141.95, N = 3SE +/- 101.28, N = 5SE +/- 7.77, N = 3SE +/- 15.38, N = 3SE +/- 99.10, N = 3SE +/- 18.19, N = 3SE +/- 69.65, N = 13SE +/- 40.83, N = 3SE +/- 18.11, N = 3SE +/- 10.59, N = 3SE +/- 85.65, N = 3SE +/- 114.62, N = 39870.69964.49996.310131.010229.010256.010279.510315.010321.010350.010354.010357.810372.010397.010410.010418.010470.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2OptNoSimplO3OptNoSimplO2GVNO3OptPREO2NewGVNO2OptPREO3OptSimplO3debugOptSimplO2PessimisticO3OptRedO3NoGVNO3NoGVNO2NewGVNO333GVNO2OptRedO24K8K12K16K20KSE +/- 107.35, N = 3SE +/- 102.53, N = 3SE +/- 131.37, N = 3SE +/- 194.05, N = 3SE +/- 142.90, N = 3SE +/- 197.17, N = 3SE +/- 33.40, N = 3SE +/- 92.64, N = 3SE +/- 98.77, N = 3SE +/- 64.02, N = 3SE +/- 72.72, N = 3SE +/- 23.39, N = 3SE +/- 139.56, N = 3SE +/- 136.41, N = 3SE +/- 81.03, N = 3SE +/- 87.96, N = 3SE +/- 42.15, N = 31339413414134711631116344163441634816373164401644416487165191652616548165541664016668

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3debugOptPREO2PessimisticO3OptSimplO2GVNO3OptRedO3NoGVNO2NewGVNO2NoGVNO3OptSimplO3OptRedO2GVNO2NewGVNO333OptPREO33K6K9K12K15KSE +/- 41.97, N = 3SE +/- 70.91, N = 3SE +/- 55.77, N = 3SE +/- 26.91, N = 3SE +/- 159.78, N = 15SE +/- 137.71, N = 15SE +/- 136.22, N = 15SE +/- 125.58, N = 15SE +/- 86.00, N = 3SE +/- 70.08, N = 3SE +/- 181.67, N = 3SE +/- 133.36, N = 3SE +/- 82.53, N = 3SE +/- 23.46, N = 3SE +/- 50.86, N = 3SE +/- 39.26, N = 3SE +/- 39.26, N = 31350213636137401498215388156111578215842158451587815906159521598115989160351611016141

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128debug2 x Intel Xeon E5-2620 v2OptNoSimplO2GVNO2OptNoSimplO3NewGVNO2OptRedO3PessimisticO3NoGVNO2OptSimplO3OptRedO2OptPREO3GVNO3OptPREO2NoGVNO3NewGVNO333OptSimplO23K6K9K12K15KSE +/- 136.03, N = 15SE +/- 136.77, N = 3SE +/- 84.00, N = 3SE +/- 79.22, N = 3SE +/- 80.49, N = 3SE +/- 169.95, N = 3SE +/- 67.87, N = 3SE +/- 90.08, N = 3SE +/- 160.95, N = 3SE +/- 130.44, N = 3SE +/- 133.00, N = 3SE +/- 120.91, N = 3SE +/- 52.35, N = 3SE +/- 132.00, N = 5SE +/- 33.80, N = 3SE +/- 46.51, N = 3SE +/- 45.94, N = 31270212740127891283112850129081300013019130261305813072131431314613167132061321913245

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256OptNoSimplO2debugPessimisticO3OptNoSimplO32 x Intel Xeon E5-2620 v2OptSimplO3NoGVNO2OptRedO3NewGVNO333NoGVNO3GVNO2GVNO3OptRedO2NewGVNO2OptPREO2OptPREO3OptSimplO23K6K9K12K15KSE +/- 68.09, N = 3SE +/- 98.34, N = 3SE +/- 101.06, N = 3SE +/- 58.29, N = 3SE +/- 67.57, N = 3SE +/- 36.46, N = 3SE +/- 30.75, N = 3SE +/- 165.24, N = 3SE +/- 146.24, N = 4SE +/- 28.45, N = 3SE +/- 150.68, N = 4SE +/- 169.36, N = 3SE +/- 145.93, N = 4SE +/- 71.88, N = 3SE +/- 145.87, N = 3SE +/- 127.87, N = 3SE +/- 115.02, N = 31365113679137941379513831139831398913997140091403214044141241412514150142471426614455

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512OptPREO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO2OptNoSimplO3OptRedO3OptSimplO2NoGVNO3PessimisticO3OptPREO2NewGVNO333debugNoGVNO2OptSimplO3GVNO3NewGVNO2GVNO23K6K9K12K15KSE +/- 45.83, N = 3SE +/- 127.67, N = 3SE +/- 157.18, N = 3SE +/- 195.78, N = 3SE +/- 66.70, N = 3SE +/- 104.45, N = 3SE +/- 60.73, N = 3SE +/- 120.34, N = 3SE +/- 168.49, N = 3SE +/- 127.81, N = 3SE +/- 163.42, N = 3SE +/- 123.58, N = 3SE +/- 81.91, N = 3SE +/- 63.39, N = 3SE +/- 109.69, N = 3SE +/- 71.36, N = 3SE +/- 89.29, N = 31521015270152891529415400154041541415471155251553815545155521556015574155761561215704

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128OptSimplO2OptRedO2debugGVNO2NoGVNO2OptPREO3OptPREO2GVNO3NewGVNO2PessimisticO3OptSimplO3OptRedO3NoGVNO3NewGVNO333OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO33K6K9K12K15KSE +/- 111.70, N = 3SE +/- 48.56, N = 3SE +/- 84.15, N = 12SE +/- 95.45, N = 3SE +/- 95.52, N = 3SE +/- 147.41, N = 3SE +/- 94.72, N = 15SE +/- 105.79, N = 7SE +/- 91.43, N = 10SE +/- 101.06, N = 3SE +/- 47.40, N = 3SE +/- 104.94, N = 8SE +/- 92.80, N = 3SE +/- 86.04, N = 3SE +/- 113.85, N = 15SE +/- 82.96, N = 15SE +/- 50.90, N = 31157811609117291173911743117811178811795117961182011934119431195211991121881229512591

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO3OptRedO3OptRedO2OptSimplO3NoGVNO2NewGVNO2GVNO2debugOptSimplO2OptPREO2GVNO3PessimisticO3OptPREO3NoGVNO3NewGVNO3332K4K6K8K10KSE +/- 119.21, N = 3SE +/- 115.75, N = 3SE +/- 21.39, N = 3SE +/- 140.84, N = 3SE +/- 108.39, N = 3SE +/- 91.42, N = 3SE +/- 85.30, N = 3SE +/- 84.51, N = 3SE +/- 40.76, N = 3SE +/- 86.71, N = 3SE +/- 35.14, N = 3SE +/- 55.86, N = 3SE +/- 34.04, N = 3SE +/- 62.20, N = 3SE +/- 60.95, N = 3SE +/- 40.95, N = 3SE +/- 65.80, N = 31108711094111371133711356113781139911408114111141411440114651146711480114821151111548

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512NewGVNO2OptRedO3OptNoSimplO22 x Intel Xeon E5-2620 v2NoGVNO3OptNoSimplO3OptSimplO2NoGVNO2NewGVNO333GVNO3OptPREO2PessimisticO3OptRedO2GVNO2OptSimplO3debugOptPREO33K6K9K12K15KSE +/- 64.51, N = 3SE +/- 39.68, N = 3SE +/- 64.54, N = 3SE +/- 37.37, N = 3SE +/- 19.97, N = 3SE +/- 61.50, N = 3SE +/- 128.25, N = 3SE +/- 84.01, N = 3SE +/- 41.82, N = 3SE +/- 52.21, N = 3SE +/- 83.95, N = 3SE +/- 60.34, N = 3SE +/- 93.72, N = 3SE +/- 40.29, N = 3SE +/- 36.67, N = 3SE +/- 97.11, N = 3SE +/- 57.89, N = 31165211709117141171511750117781178011803118031181811821118211183611841118581187011893

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2OptNoSimplO3OptNoSimplO2NewGVNO333OptSimplO2OptSimplO3NoGVNO3OptPREO2OptRedO2GVNO3NoGVNO2OptPREO3GVNO2OptRedO3PessimisticO3NewGVNO2debug3K6K9K12K15KSE +/- 112.78, N = 3SE +/- 32.22, N = 3SE +/- 43.66, N = 3SE +/- 129.47, N = 3SE +/- 73.63, N = 3SE +/- 141.62, N = 3SE +/- 80.68, N = 3SE +/- 82.71, N = 3SE +/- 85.56, N = 3SE +/- 15.17, N = 3SE +/- 67.72, N = 3SE +/- 21.01, N = 3SE +/- 78.73, N = 3SE +/- 94.45, N = 3SE +/- 78.81, N = 3SE +/- 75.24, N = 3SE +/- 80.85, N = 31551715630156351593915962160371605916090160991611916141161431616716169162231623416266

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048OptPREO3OptPREO2GVNO3GVNO2NoGVNO3NoGVNO2PessimisticO3debugOptSimplO2OptRedO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO3NewGVNO2OptRedO2NewGVNO333OptSimplO33K6K9K12K15KSE +/- 16.00, N = 3SE +/- 53.18, N = 3SE +/- 76.47, N = 3SE +/- 20.00, N = 3SE +/- 78.24, N = 3SE +/- 75.96, N = 3SE +/- 97.36, N = 3SE +/- 41.31, N = 3SE +/- 37.49, N = 3SE +/- 33.86, N = 3SE +/- 107.49, N = 3SE +/- 24.95, N = 3SE +/- 15.63, N = 3SE +/- 42.13, N = 3SE +/- 58.95, N = 3SE +/- 20.01, N = 3SE +/- 27.28, N = 31538915407154261542815457154611548115513155261553815539155431555115564155681557115631

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096OptPREO3debugOptSimplO3NoGVNO2OptRedO2OptNoSimplO22 x Intel Xeon E5-2620 v2OptPREO2GVNO3OptNoSimplO3OptSimplO2NoGVNO3GVNO2PessimisticO3OptRedO3NewGVNO333NewGVNO23K6K9K12K15KSE +/- 89.82, N = 3SE +/- 193.73, N = 3SE +/- 94.10, N = 3SE +/- 80.21, N = 3SE +/- 62.13, N = 3SE +/- 19.46, N = 3SE +/- 82.03, N = 3SE +/- 55.00, N = 3SE +/- 41.07, N = 3SE +/- 55.79, N = 3SE +/- 123.09, N = 3SE +/- 135.86, N = 3SE +/- 46.84, N = 3SE +/- 33.27, N = 3SE +/- 94.63, N = 3SE +/- 25.71, N = 3SE +/- 56.04, N = 31397914091141851418614234142431425914279142891431714337143531436314375144301443414438

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024debugOptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3NewGVNO333GVNO3OptPREO2PessimisticO3OptPREO3OptSimplO3OptRedO2NewGVNO2GVNO2NoGVNO3OptRedO3NoGVNO2OptSimplO23K6K9K12K15KSE +/- 644.68, N = 12SE +/- 124.86, N = 3SE +/- 40.01, N = 3SE +/- 58.86, N = 3SE +/- 72.25, N = 3SE +/- 43.51, N = 3SE +/- 58.30, N = 3SE +/- 46.46, N = 3SE +/- 74.77, N = 3SE +/- 70.39, N = 3SE +/- 76.29, N = 3SE +/- 63.84, N = 3SE +/- 69.17, N = 3SE +/- 76.28, N = 3SE +/- 15.68, N = 3SE +/- 81.04, N = 3SE +/- 48.60, N = 310890.711903.011906.011949.011990.012064.012125.012131.012132.012182.012202.012213.012218.012221.012298.012316.012337.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048debug2 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO3GVNO2NewGVNO2OptPREO3OptPREO2OptRedO2OptSimplO2NoGVNO2GVNO3OptRedO3OptSimplO3NoGVNO3NewGVNO333PessimisticO315003000450060007500SE +/- 44.49, N = 3SE +/- 41.03, N = 3SE +/- 33.67, N = 3SE +/- 25.37, N = 3SE +/- 18.93, N = 3SE +/- 12.08, N = 3SE +/- 43.40, N = 3SE +/- 26.38, N = 3SE +/- 19.13, N = 3SE +/- 15.79, N = 3SE +/- 11.97, N = 3SE +/- 2.26, N = 3SE +/- 28.87, N = 3SE +/- 14.30, N = 3SE +/- 8.17, N = 3SE +/- 2.80, N = 3SE +/- 10.65, N = 36459.16665.96728.26731.16983.66988.06992.77005.57021.67024.07027.07049.27059.07072.17077.77080.87085.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3debugOptPREO2OptSimplO2NewGVNO2OptRedO2GVNO2OptPREO3OptSimplO3PessimisticO3NoGVNO3NoGVNO2OptRedO3GVNO3NewGVNO33314002800420056007000SE +/- 4.28, N = 3SE +/- 8.82, N = 3SE +/- 7.27, N = 3SE +/- 262.37, N = 9SE +/- 15.10, N = 3SE +/- 9.08, N = 3SE +/- 10.55, N = 3SE +/- 4.50, N = 3SE +/- 5.39, N = 3SE +/- 20.27, N = 3SE +/- 11.45, N = 3SE +/- 8.99, N = 3SE +/- 21.46, N = 3SE +/- 13.11, N = 3SE +/- 15.62, N = 3SE +/- 23.90, N = 3SE +/- 23.42, N = 35603.75618.55634.26092.76688.46694.66697.26697.36703.26707.06707.96716.16717.16717.96718.86724.56732.2