fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2409043-NE-FFTW120RU44
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
debug
July 13
  1 Hour, 54 Minutes
NoGVNO2
July 14
  57 Minutes
NoGVNO3
July 15
  57 Minutes
OptNoSimplO2
July 16
  17 Minutes
2 x Intel Xeon E5-2620 v2
July 17
  17 Minutes
OptNoSimplO3
July 19
  39 Minutes
OptSimplO2
July 21
  59 Minutes
OptSimplO3
July 22
  58 Minutes
OptRedO2
July 27
  57 Minutes
OptRedO3
July 28
  58 Minutes
OptPREO2
August 06
  59 Minutes
GVNO2
August 07
  58 Minutes
NewGVNO2
August 09
  58 Minutes
OptPREO3
August 12
  58 Minutes
NewGVNO333
August 28
  58 Minutes
GVNO3
August 29
  59 Minutes
PessimisticO3
September 04
  58 Minutes
Invert Hiding All Results Option
  55 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


fftw-1.2.0runProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutiondebugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO32 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x7682000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850ASUS VW1905.14.0-474.el9.x86_64 (x86_64)5.14.0-480.el9.x86_64 (x86_64)256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P5.14.0-496.el9.x86_64 (x86_64)GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.25.14.0-503.el9.x86_64 (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptNoSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptRedO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptRedO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptPREO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- GVNO2: CXXFLAGS=-O2 CFLAGS=-O2- NewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- OptPREO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO333: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- GVNO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO3: CXXFLAGS=-O3 CFLAGS=-O3Compiler Details- Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686Processor Details- Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42eSecurity Details- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3Result OverviewPhoronix Test Suite100%107%113%120%FFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWStock - 2D FFT Size 4096Float + SSE - 2D FFT Size 32Float + SSE - 2D FFT Size 4096Float + SSE - 2D FFT Size 64Float + SSE - 2D FFT Size 1024Float + SSE - 2D FFT Size 2048Float + SSE - 2D FFT Size 128Float + SSE - 1D FFT Size 32Float + SSE - 1D FFT Size 64Float + SSE - 1D FFT Size 256Float + SSE - 1D FFT Size 1024Stock - 2D FFT Size 1024Float + SSE - 1D FFT Size 128Float + SSE - 2D FFT Size 256Stock - 2D FFT Size 2048Stock - 2D FFT Size 64Float + SSE - 1D FFT Size 4096Stock - 1D FFT Size 128Float + SSE - 1D FFT Size 512Stock - 2D FFT Size 512Stock - 2D FFT Size 256Stock - 1D FFT Size 1024Stock - 1D FFT Size 32Stock - 1D FFT Size 256Float + SSE - 2D FFT Size 512Stock - 2D FFT Size 32Stock - 1D FFT Size 512Stock - 1D FFT Size 2048Float + SSE - 1D FFT Size 2048Stock - 2D FFT Size 128Stock - 1D FFT Size 4096Stock - 1D FFT Size 64

fftw-1.2.0runfftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 1024fftw: Stock - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 4096fftw: Stock - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 512fftw: Stock - 2D FFT Size 512fftw: Stock - 2D FFT Size 256fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Stock - 2D FFT Size 32fftw: Stock - 1D FFT Size 512fftw: Stock - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 2048fftw: Stock - 2D FFT Size 128fftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO32751.4164406092.71498210890.76459.1117297287.09870.613679162663123.112702114142706.64309.2140914420.6155523780.73660.64285.64998.54296.2118705218.74320.44055.6155133855.83914.84906.82756.0165486717.915878123167027.0117437311.41035013989161413091.613026113992723.74336.7141864414.6155603773.33686.74282.94967.14308.3118035211.54344.44051.3154613835.53895.44907.72738.9165266717.115952122217077.7119527347.41047014032160593107.213206115112736.04289.8143534379.5154713797.53693.34270.05020.84290.5117505219.04345.74055.0154573861.63912.14897.92231.1134715603.713502119036728.2121886805.01013113651156353053.112789110942662.04337.1142434358.8152893679.23671.34233.75008.94309.2117145127.94340.14046.9155433812.53909.54846.92239.3133945618.513636119066665.9122956925.71022913831155173048.512740110872642.14336.6142594354.3152703736.53634.94178.45024.64316.1117155115.04285.84046.4155393848.03907.04899.52200.0134145634.213740119496731.1125916912.79964.413795156303018.712850111372664.34354.7143174364.2154003751.13616.94198.14898.04312.8117785144.64314.44031.3155513839.43917.44899.92757.9164446694.615782123377024.0115787116.01025614455159623110.313245114402726.64318.6143374408.4154143748.43642.84276.04962.74275.6117805217.94334.24030.6155263836.53911.54911.42758.9163736707.915981121827072.1119347041.31032113983160373083.813058113782725.04326.3141854417.1155743759.43673.04286.34963.84247.7118585213.44346.84039.5156313836.43919.34905.72773.9166686697.315989122027021.6116097252.41035414125160993123.413072113562728.14320.8142344415.5152943780.63622.24280.94957.94288.1118365189.34340.04048.3155683831.93923.94892.02767.1165196718.815845122987059.0119437194.91039713997161693149.113000113372707.24337.3144304415.3154043784.43655.84252.94990.14329.5117095220.54367.84037.2155383821.53921.74899.12762.4163446688.415388121257005.5117887355.910357.814247160903126.013167114652710.84286.4142794402.8155383775.93641.64267.95000.04314.3118215144.84366.84045.5154073855.93931.34902.92755.0166406703.216035122186983.6117397205.19996.314044161673142.012831114112716.24206.7143634433.2157043767.93589.54232.84997.44300.9118415179.14298.43989.4154283836.23936.04885.02752.9163446697.215906122136988.0117967397.61041814150162343138.112908114082727.84316.4144384405.0156123769.93648.04263.25005.74308.3116525213.24334.24051.5155643834.93924.24894.22755.4163486707.016141121326992.7117817126.71031514266161433085.713143114822700.44321.4139794427.2152103785.13583.54288.45008.84241.6118935209.04338.54034.4153893837.33920.74893.82745.8165546732.216110119907080.8119917316.410279.514009159393114.713219115482718.04265.1144344294.0155453776.93610.34286.75021.34317.0118035180.44332.94053.4155713824.53934.44891.02779.8163116724.515842120647049.2117957243.11037214124161193120.213146114672705.04347.4142894327.3155763761.93654.94290.65019.84313.4118185176.44352.14042.0154263805.33950.04899.62774.8164876716.115611121317085.3118207166.11041013794162233133.013019114802726.84280.9143754434.7155253780.83603.64282.54979.94289.2118215176.14350.44063.8154813812.33934.64914.1OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GVNO3PessimisticO3OptRedO2OptRedO3OptPREO2OptSimplO3OptSimplO2NoGVNO2OptPREO3GVNO2NewGVNO2debugNewGVNO333NoGVNO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO36001200180024003000SE +/- 19.40, N = 3SE +/- 23.10, N = 3SE +/- 15.05, N = 3SE +/- 13.13, N = 3SE +/- 11.48, N = 3SE +/- 5.14, N = 3SE +/- 8.89, N = 3SE +/- 14.23, N = 3SE +/- 7.45, N = 3SE +/- 5.66, N = 3SE +/- 8.68, N = 3SE +/- 5.74, N = 3SE +/- 13.80, N = 3SE +/- 16.56, N = 3SE +/- 8.09, N = 3SE +/- 16.21, N = 3SE +/- 21.99, N = 152779.82774.82773.92767.12762.42758.92757.92756.02755.42755.02752.92751.42745.82738.92239.32231.12200.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32OptRedO2GVNO2NewGVNO333NoGVNO2NoGVNO3OptRedO3PessimisticO3OptSimplO2debugOptSimplO3OptPREO3NewGVNO2OptPREO2GVNO3OptNoSimplO2OptNoSimplO32 x Intel Xeon E5-2620 v24K8K12K16K20KSE +/- 42.15, N = 3SE +/- 87.96, N = 3SE +/- 81.03, N = 3SE +/- 136.41, N = 3SE +/- 139.56, N = 3SE +/- 23.39, N = 3SE +/- 72.72, N = 3SE +/- 64.02, N = 3SE +/- 98.77, N = 3SE +/- 92.64, N = 3SE +/- 33.40, N = 3SE +/- 197.17, N = 3SE +/- 142.90, N = 3SE +/- 194.05, N = 3SE +/- 131.37, N = 3SE +/- 102.53, N = 3SE +/- 107.35, N = 31666816640165541654816526165191648716444164401637316348163441634416311134711341413394

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096NewGVNO333GVNO3OptRedO3NoGVNO2NoGVNO3PessimisticO3OptSimplO3OptPREO3GVNO2OptRedO2NewGVNO2OptSimplO2OptPREO2debugOptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO214002800420056007000SE +/- 23.42, N = 3SE +/- 23.90, N = 3SE +/- 15.62, N = 3SE +/- 13.11, N = 3SE +/- 21.46, N = 3SE +/- 8.99, N = 3SE +/- 11.45, N = 3SE +/- 20.27, N = 3SE +/- 5.39, N = 3SE +/- 4.50, N = 3SE +/- 10.55, N = 3SE +/- 9.08, N = 3SE +/- 15.10, N = 3SE +/- 262.37, N = 9SE +/- 7.27, N = 3SE +/- 8.82, N = 3SE +/- 4.28, N = 36732.26724.56718.86717.96717.16716.16707.96707.06703.26697.36697.26694.66688.46092.75634.25618.55603.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64OptPREO3NewGVNO333GVNO2OptRedO2OptSimplO3NoGVNO3NewGVNO2NoGVNO2OptRedO3GVNO3OptSimplO2PessimisticO3OptPREO2debugOptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO23K6K9K12K15KSE +/- 39.26, N = 3SE +/- 39.26, N = 3SE +/- 50.86, N = 3SE +/- 23.46, N = 3SE +/- 82.53, N = 3SE +/- 133.36, N = 3SE +/- 181.67, N = 3SE +/- 70.08, N = 3SE +/- 86.00, N = 3SE +/- 125.58, N = 15SE +/- 136.22, N = 15SE +/- 137.71, N = 15SE +/- 159.78, N = 15SE +/- 26.91, N = 3SE +/- 55.77, N = 3SE +/- 70.91, N = 3SE +/- 41.97, N = 31614116110160351598915981159521590615878158451584215782156111538814982137401363613502

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024OptSimplO2NoGVNO2OptRedO3NoGVNO3GVNO2NewGVNO2OptRedO2OptSimplO3OptPREO3PessimisticO3OptPREO2GVNO3NewGVNO333OptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2debug3K6K9K12K15KSE +/- 48.60, N = 3SE +/- 81.04, N = 3SE +/- 15.68, N = 3SE +/- 76.28, N = 3SE +/- 69.17, N = 3SE +/- 63.84, N = 3SE +/- 76.29, N = 3SE +/- 70.39, N = 3SE +/- 74.77, N = 3SE +/- 46.46, N = 3SE +/- 58.30, N = 3SE +/- 43.51, N = 3SE +/- 72.25, N = 3SE +/- 58.86, N = 3SE +/- 40.01, N = 3SE +/- 124.86, N = 3SE +/- 644.68, N = 1212337.012316.012298.012221.012218.012213.012202.012182.012132.012131.012125.012064.011990.011949.011906.011903.010890.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048PessimisticO3NewGVNO333NoGVNO3OptSimplO3OptRedO3GVNO3NoGVNO2OptSimplO2OptRedO2OptPREO2OptPREO3NewGVNO2GVNO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2debug15003000450060007500SE +/- 10.65, N = 3SE +/- 2.80, N = 3SE +/- 8.17, N = 3SE +/- 14.30, N = 3SE +/- 28.87, N = 3SE +/- 2.26, N = 3SE +/- 11.97, N = 3SE +/- 15.79, N = 3SE +/- 19.13, N = 3SE +/- 26.38, N = 3SE +/- 43.40, N = 3SE +/- 12.08, N = 3SE +/- 18.93, N = 3SE +/- 25.37, N = 3SE +/- 33.67, N = 3SE +/- 41.03, N = 3SE +/- 44.49, N = 37085.37080.87077.77072.17059.07049.27027.07024.07021.67005.56992.76988.06983.66731.16728.26665.96459.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128OptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2NewGVNO333NoGVNO3OptRedO3OptSimplO3PessimisticO3NewGVNO2GVNO3OptPREO2OptPREO3NoGVNO2GVNO2debugOptRedO2OptSimplO23K6K9K12K15KSE +/- 50.90, N = 3SE +/- 82.96, N = 15SE +/- 113.85, N = 15SE +/- 86.04, N = 3SE +/- 92.80, N = 3SE +/- 104.94, N = 8SE +/- 47.40, N = 3SE +/- 101.06, N = 3SE +/- 91.43, N = 10SE +/- 105.79, N = 7SE +/- 94.72, N = 15SE +/- 147.41, N = 3SE +/- 95.52, N = 3SE +/- 95.45, N = 3SE +/- 84.15, N = 12SE +/- 48.56, N = 3SE +/- 111.70, N = 31259112295121881199111952119431193411820117961179511788117811174311739117291160911578

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32NewGVNO2OptPREO2NoGVNO3NewGVNO333NoGVNO2debugOptRedO2GVNO3GVNO2OptRedO3PessimisticO3OptPREO3OptSimplO2OptSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO3OptNoSimplO216003200480064008000SE +/- 12.13, N = 3SE +/- 98.33, N = 3SE +/- 35.65, N = 3SE +/- 72.31, N = 3SE +/- 79.17, N = 3SE +/- 5.79, N = 3SE +/- 76.45, N = 5SE +/- 82.98, N = 4SE +/- 65.67, N = 7SE +/- 57.27, N = 15SE +/- 56.02, N = 15SE +/- 54.90, N = 15SE +/- 52.52, N = 15SE +/- 90.54, N = 15SE +/- 64.34, N = 15SE +/- 71.46, N = 15SE +/- 24.17, N = 37397.67355.97347.47316.47311.47287.07252.47243.17205.17194.97166.17126.77116.07041.36925.76912.76805.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64NoGVNO3NewGVNO2PessimisticO3OptRedO3GVNO3OptPREO2OptRedO2NoGVNO2OptSimplO3OptPREO3NewGVNO333OptSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO2GVNO2OptNoSimplO3debug2K4K6K8K10KSE +/- 114.62, N = 3SE +/- 85.65, N = 3SE +/- 10.59, N = 3SE +/- 18.11, N = 3SE +/- 40.83, N = 3SE +/- 69.65, N = 13SE +/- 18.19, N = 3SE +/- 99.10, N = 3SE +/- 15.38, N = 3SE +/- 7.77, N = 3SE +/- 101.28, N = 5SE +/- 141.95, N = 3SE +/- 54.50, N = 3SE +/- 33.87, N = 3SE +/- 144.31, N = 15SE +/- 98.25, N = 3SE +/- 172.96, N = 1510470.010418.010410.010397.010372.010357.810354.010350.010321.010315.010279.510256.010229.010131.09996.39964.49870.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256OptSimplO2OptPREO3OptPREO2NewGVNO2OptRedO2GVNO3GVNO2NoGVNO3NewGVNO333OptRedO3NoGVNO2OptSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO3PessimisticO3debugOptNoSimplO23K6K9K12K15KSE +/- 115.02, N = 3SE +/- 127.87, N = 3SE +/- 145.87, N = 3SE +/- 71.88, N = 3SE +/- 145.93, N = 4SE +/- 169.36, N = 3SE +/- 150.68, N = 4SE +/- 28.45, N = 3SE +/- 146.24, N = 4SE +/- 165.24, N = 3SE +/- 30.75, N = 3SE +/- 36.46, N = 3SE +/- 67.57, N = 3SE +/- 58.29, N = 3SE +/- 101.06, N = 3SE +/- 98.34, N = 3SE +/- 68.09, N = 31445514266142471415014125141241404414032140091399713989139831383113795137941367913651

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024debugNewGVNO2PessimisticO3OptRedO3GVNO2OptPREO3NoGVNO2GVNO3OptRedO2OptPREO2NoGVNO3OptSimplO3OptSimplO2NewGVNO333OptNoSimplO2OptNoSimplO32 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 80.85, N = 3SE +/- 75.24, N = 3SE +/- 78.81, N = 3SE +/- 94.45, N = 3SE +/- 78.73, N = 3SE +/- 21.01, N = 3SE +/- 67.72, N = 3SE +/- 15.17, N = 3SE +/- 85.56, N = 3SE +/- 82.71, N = 3SE +/- 80.68, N = 3SE +/- 141.62, N = 3SE +/- 73.63, N = 3SE +/- 129.47, N = 3SE +/- 43.66, N = 3SE +/- 32.22, N = 3SE +/- 112.78, N = 31626616234162231616916167161431614116119160991609016059160371596215939156351563015517

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024OptRedO3GVNO2NewGVNO2PessimisticO3OptPREO2OptRedO2debugGVNO3NewGVNO333OptSimplO2NoGVNO3NoGVNO2OptPREO3OptSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO37001400210028003500SE +/- 9.39, N = 3SE +/- 5.57, N = 3SE +/- 18.27, N = 3SE +/- 1.11, N = 3SE +/- 8.38, N = 3SE +/- 14.89, N = 3SE +/- 15.08, N = 3SE +/- 12.60, N = 3SE +/- 24.45, N = 3SE +/- 26.70, N = 3SE +/- 38.36, N = 3SE +/- 13.41, N = 3SE +/- 15.98, N = 3SE +/- 10.08, N = 3SE +/- 12.20, N = 3SE +/- 13.57, N = 3SE +/- 8.28, N = 33149.13142.03138.13133.03126.03123.43123.13120.23114.73110.33107.23091.63085.73083.83053.13048.53018.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128OptSimplO2NewGVNO333NoGVNO3OptPREO2GVNO3OptPREO3OptRedO2OptSimplO3NoGVNO2PessimisticO3OptRedO3NewGVNO2OptNoSimplO3GVNO2OptNoSimplO22 x Intel Xeon E5-2620 v2debug3K6K9K12K15KSE +/- 45.94, N = 3SE +/- 46.51, N = 3SE +/- 33.80, N = 3SE +/- 132.00, N = 5SE +/- 52.35, N = 3SE +/- 120.91, N = 3SE +/- 133.00, N = 3SE +/- 130.44, N = 3SE +/- 160.95, N = 3SE +/- 90.08, N = 3SE +/- 67.87, N = 3SE +/- 169.95, N = 3SE +/- 80.49, N = 3SE +/- 79.22, N = 3SE +/- 84.00, N = 3SE +/- 136.77, N = 3SE +/- 136.03, N = 151324513219132061316713146131431307213058130261301913000129081285012831127891274012702

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 256NewGVNO333NoGVNO3OptPREO3PessimisticO3GVNO3OptPREO2OptSimplO2debugGVNO2NewGVNO2NoGVNO2OptSimplO3OptRedO2OptRedO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v22K4K6K8K10KSE +/- 65.80, N = 3SE +/- 40.95, N = 3SE +/- 60.95, N = 3SE +/- 62.20, N = 3SE +/- 34.04, N = 3SE +/- 55.86, N = 3SE +/- 35.14, N = 3SE +/- 86.71, N = 3SE +/- 40.76, N = 3SE +/- 84.51, N = 3SE +/- 85.30, N = 3SE +/- 91.42, N = 3SE +/- 108.39, N = 3SE +/- 140.84, N = 3SE +/- 21.39, N = 3SE +/- 115.75, N = 3SE +/- 119.21, N = 31154811511114821148011467114651144011414114111140811399113781135611337111371109411087

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048NoGVNO3OptRedO2NewGVNO2PessimisticO3OptSimplO2OptSimplO3NoGVNO2NewGVNO333GVNO2OptPREO2OptRedO3debugGVNO3OptPREO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v26001200180024003000SE +/- 12.56, N = 3SE +/- 16.99, N = 3SE +/- 19.22, N = 3SE +/- 5.76, N = 3SE +/- 17.93, N = 3SE +/- 16.84, N = 3SE +/- 8.21, N = 3SE +/- 6.49, N = 3SE +/- 13.80, N = 3SE +/- 13.94, N = 3SE +/- 12.30, N = 3SE +/- 3.24, N = 3SE +/- 18.48, N = 3SE +/- 11.61, N = 3SE +/- 6.56, N = 3SE +/- 6.15, N = 3SE +/- 5.49, N = 32736.02728.12727.82726.82726.62725.02723.72718.02716.22710.82707.22706.62705.02700.42664.32662.02642.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64OptNoSimplO3GVNO3OptRedO3OptNoSimplO2NoGVNO22 x Intel Xeon E5-2620 v2OptSimplO3OptPREO3OptRedO2OptSimplO2NewGVNO2debugNoGVNO3OptPREO2PessimisticO3NewGVNO333GVNO29001800270036004500SE +/- 7.74, N = 3SE +/- 2.62, N = 3SE +/- 18.00, N = 3SE +/- 4.58, N = 3SE +/- 4.40, N = 3SE +/- 5.04, N = 3SE +/- 21.71, N = 3SE +/- 3.34, N = 3SE +/- 22.79, N = 3SE +/- 4.27, N = 3SE +/- 22.90, N = 3SE +/- 21.54, N = 3SE +/- 46.52, N = 4SE +/- 61.56, N = 3SE +/- 60.71, N = 3SE +/- 60.33, N = 3SE +/- 51.49, N = 44354.74347.44337.34337.14336.74336.64326.34321.44320.84318.64316.44309.24289.84286.44280.94265.14206.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096NewGVNO2NewGVNO333OptRedO3PessimisticO3GVNO2NoGVNO3OptSimplO2OptNoSimplO3GVNO3OptPREO22 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO2NoGVNO2OptSimplO3debugOptPREO33K6K9K12K15KSE +/- 56.04, N = 3SE +/- 25.71, N = 3SE +/- 94.63, N = 3SE +/- 33.27, N = 3SE +/- 46.84, N = 3SE +/- 135.86, N = 3SE +/- 123.09, N = 3SE +/- 55.79, N = 3SE +/- 41.07, N = 3SE +/- 55.00, N = 3SE +/- 82.03, N = 3SE +/- 19.46, N = 3SE +/- 62.13, N = 3SE +/- 80.21, N = 3SE +/- 94.10, N = 3SE +/- 193.73, N = 3SE +/- 89.82, N = 31443814434144301437514363143531433714317142891427914259142431423414186141851409113979

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128PessimisticO3GVNO2OptPREO3debugOptSimplO3OptRedO2OptRedO3NoGVNO2OptSimplO2NewGVNO2OptPREO2NoGVNO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2GVNO3NewGVNO33310002000300040005000SE +/- 1.58, N = 3SE +/- 1.52, N = 3SE +/- 9.28, N = 3SE +/- 6.43, N = 3SE +/- 2.14, N = 3SE +/- 4.33, N = 3SE +/- 2.25, N = 3SE +/- 2.46, N = 3SE +/- 2.45, N = 3SE +/- 5.65, N = 3SE +/- 4.02, N = 3SE +/- 35.10, N = 9SE +/- 2.94, N = 3SE +/- 1.43, N = 3SE +/- 3.10, N = 3SE +/- 52.07, N = 15SE +/- 51.58, N = 154434.74433.24427.24420.64417.14415.54415.34414.64408.44405.04402.84379.54364.24358.84354.34327.34294.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512GVNO2NewGVNO2GVNO3OptSimplO3NoGVNO2debugNewGVNO333OptPREO2PessimisticO3NoGVNO3OptSimplO2OptRedO3OptNoSimplO3OptRedO2OptNoSimplO22 x Intel Xeon E5-2620 v2OptPREO33K6K9K12K15KSE +/- 89.29, N = 3SE +/- 71.36, N = 3SE +/- 109.69, N = 3SE +/- 63.39, N = 3SE +/- 81.91, N = 3SE +/- 123.58, N = 3SE +/- 163.42, N = 3SE +/- 127.81, N = 3SE +/- 168.49, N = 3SE +/- 120.34, N = 3SE +/- 60.73, N = 3SE +/- 104.45, N = 3SE +/- 66.70, N = 3SE +/- 195.78, N = 3SE +/- 157.18, N = 3SE +/- 127.67, N = 3SE +/- 45.83, N = 31570415612155761557415560155521554515538155251547115414154041540015294152891527015210

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512NoGVNO3OptPREO3OptRedO3PessimisticO3debugOptRedO2NewGVNO333OptPREO2NoGVNO2NewGVNO2GVNO2GVNO3OptSimplO3OptNoSimplO3OptSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO28001600240032004000SE +/- 10.93, N = 3SE +/- 3.13, N = 3SE +/- 7.10, N = 3SE +/- 21.49, N = 3SE +/- 10.97, N = 3SE +/- 6.77, N = 3SE +/- 14.05, N = 3SE +/- 16.55, N = 3SE +/- 13.13, N = 3SE +/- 20.08, N = 3SE +/- 5.58, N = 3SE +/- 6.57, N = 3SE +/- 9.69, N = 3SE +/- 14.36, N = 3SE +/- 34.42, N = 12SE +/- 41.40, N = 5SE +/- 38.03, N = 53797.53785.13784.43780.83780.73780.63776.93775.93773.33769.93767.93761.93759.43751.13748.43736.53679.2

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256NoGVNO3NoGVNO2OptSimplO3OptNoSimplO2debugOptRedO3GVNO3NewGVNO2OptSimplO2OptPREO22 x Intel Xeon E5-2620 v2OptRedO2OptNoSimplO3NewGVNO333PessimisticO3GVNO2OptPREO38001600240032004000SE +/- 7.11, N = 3SE +/- 13.35, N = 3SE +/- 34.82, N = 3SE +/- 17.09, N = 3SE +/- 23.34, N = 3SE +/- 33.79, N = 3SE +/- 12.33, N = 3SE +/- 33.28, N = 3SE +/- 34.93, N = 3SE +/- 12.43, N = 3SE +/- 23.89, N = 3SE +/- 16.18, N = 3SE +/- 37.88, N = 3SE +/- 23.72, N = 3SE +/- 50.06, N = 3SE +/- 47.63, N = 3SE +/- 35.49, N = 33693.33686.73673.03671.33660.63655.83654.93648.03642.83641.63634.93622.23616.93610.33603.63589.53583.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GVNO3OptPREO3NewGVNO333OptSimplO3debugNoGVNO2PessimisticO3OptRedO2OptSimplO2NoGVNO3OptPREO2NewGVNO2OptRedO3OptNoSimplO2GVNO2OptNoSimplO32 x Intel Xeon E5-2620 v29001800270036004500SE +/- 8.12, N = 3SE +/- 4.55, N = 3SE +/- 7.01, N = 3SE +/- 4.50, N = 3SE +/- 5.54, N = 3SE +/- 2.87, N = 3SE +/- 10.52, N = 3SE +/- 4.77, N = 3SE +/- 11.99, N = 3SE +/- 19.30, N = 3SE +/- 14.59, N = 3SE +/- 9.92, N = 3SE +/- 32.58, N = 3SE +/- 8.01, N = 3SE +/- 48.65, N = 4SE +/- 25.83, N = 3SE +/- 59.35, N = 34290.64288.44286.74286.34285.64282.94282.54280.94276.04270.04267.94263.24252.94233.74232.84198.14178.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x Intel Xeon E5-2620 v2NewGVNO333NoGVNO3GVNO3OptNoSimplO2OptPREO3NewGVNO2OptPREO2debugGVNO2OptRedO3PessimisticO3NoGVNO2OptSimplO3OptSimplO2OptRedO2OptNoSimplO311002200330044005500SE +/- 4.74, N = 3SE +/- 2.77, N = 3SE +/- 23.10, N = 3SE +/- 26.35, N = 3SE +/- 17.04, N = 3SE +/- 18.68, N = 3SE +/- 31.02, N = 3SE +/- 15.07, N = 3SE +/- 29.89, N = 3SE +/- 13.08, N = 3SE +/- 13.97, N = 3SE +/- 15.16, N = 3SE +/- 8.35, N = 3SE +/- 39.97, N = 3SE +/- 25.51, N = 3SE +/- 31.02, N = 3SE +/- 95.20, N = 135024.65021.35020.85019.85008.95008.85005.75000.04998.54997.44990.14979.94967.14963.84962.74957.94898.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256OptRedO3NewGVNO3332 x Intel Xeon E5-2620 v2OptPREO2GVNO3OptNoSimplO3OptNoSimplO2NewGVNO2NoGVNO2GVNO2debugNoGVNO3PessimisticO3OptRedO2OptSimplO2OptSimplO3OptPREO39001800270036004500SE +/- 17.30, N = 3SE +/- 17.44, N = 3SE +/- 27.37, N = 3SE +/- 3.18, N = 3SE +/- 19.87, N = 3SE +/- 24.49, N = 3SE +/- 1.60, N = 3SE +/- 5.17, N = 3SE +/- 1.57, N = 3SE +/- 24.27, N = 3SE +/- 1.72, N = 3SE +/- 15.53, N = 3SE +/- 10.82, N = 3SE +/- 22.07, N = 3SE +/- 8.48, N = 3SE +/- 53.04, N = 3SE +/- 43.31, N = 34329.54317.04316.14314.34313.44312.84309.24308.34308.34300.94296.24290.54289.24288.14275.64247.74241.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512OptPREO3debugOptSimplO3GVNO2OptRedO2PessimisticO3OptPREO2GVNO3NewGVNO333NoGVNO2OptSimplO2OptNoSimplO3NoGVNO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO3NewGVNO23K6K9K12K15KSE +/- 57.89, N = 3SE +/- 97.11, N = 3SE +/- 36.67, N = 3SE +/- 40.29, N = 3SE +/- 93.72, N = 3SE +/- 60.34, N = 3SE +/- 83.95, N = 3SE +/- 52.21, N = 3SE +/- 41.82, N = 3SE +/- 84.01, N = 3SE +/- 128.25, N = 3SE +/- 61.50, N = 3SE +/- 19.97, N = 3SE +/- 37.37, N = 3SE +/- 64.54, N = 3SE +/- 39.68, N = 3SE +/- 64.51, N = 31189311870118581184111836118211182111818118031180311780117781175011715117141170911652

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32OptRedO3NoGVNO3debugOptSimplO2OptSimplO3NewGVNO2NoGVNO2OptPREO3OptRedO2NewGVNO333GVNO2GVNO3PessimisticO3OptPREO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v211002200330044005500SE +/- 11.16, N = 3SE +/- 6.35, N = 3SE +/- 7.51, N = 3SE +/- 6.97, N = 3SE +/- 12.54, N = 3SE +/- 1.29, N = 3SE +/- 12.52, N = 3SE +/- 5.69, N = 3SE +/- 32.60, N = 3SE +/- 11.03, N = 3SE +/- 33.03, N = 3SE +/- 39.92, N = 3SE +/- 36.57, N = 3SE +/- 55.21, N = 3SE +/- 41.81, N = 3SE +/- 43.81, N = 3SE +/- 6.03, N = 35220.55219.05218.75217.95213.45213.25211.55209.05189.35180.45179.15176.45176.15144.85144.65127.95115.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 512OptRedO3OptPREO2GVNO3PessimisticO3OptSimplO3NoGVNO3NoGVNO2OptNoSimplO2OptRedO2OptPREO3NewGVNO2OptSimplO2NewGVNO333debugOptNoSimplO3GVNO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 10.85, N = 3SE +/- 8.76, N = 3SE +/- 15.02, N = 3SE +/- 16.20, N = 3SE +/- 4.56, N = 3SE +/- 15.04, N = 3SE +/- 12.50, N = 3SE +/- 2.84, N = 3SE +/- 3.08, N = 3SE +/- 13.39, N = 3SE +/- 7.37, N = 3SE +/- 27.65, N = 3SE +/- 10.16, N = 3SE +/- 16.10, N = 3SE +/- 11.47, N = 3SE +/- 23.58, N = 3SE +/- 22.16, N = 34367.84366.84352.14350.44346.84345.74344.44340.14340.04338.54334.24334.24332.94320.44314.44298.44285.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048PessimisticO3debugNoGVNO3NewGVNO333NewGVNO2NoGVNO2OptRedO2OptNoSimplO22 x Intel Xeon E5-2620 v2OptPREO2GVNO3OptSimplO3OptRedO3OptPREO3OptNoSimplO3OptSimplO2GVNO29001800270036004500SE +/- 9.60, N = 3SE +/- 6.96, N = 3SE +/- 8.12, N = 3SE +/- 8.15, N = 3SE +/- 12.48, N = 3SE +/- 6.84, N = 3SE +/- 14.17, N = 3SE +/- 18.65, N = 3SE +/- 11.79, N = 3SE +/- 4.44, N = 3SE +/- 14.25, N = 3SE +/- 7.07, N = 3SE +/- 22.16, N = 3SE +/- 16.83, N = 3SE +/- 6.58, N = 3SE +/- 14.04, N = 3SE +/- 47.88, N = 44063.84055.64055.04053.44051.54051.34048.34046.94046.44045.54042.04039.54037.24034.44031.34030.63989.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048OptSimplO3NewGVNO333OptRedO2NewGVNO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptRedO3OptSimplO2debugPessimisticO3NoGVNO2NoGVNO3GVNO2GVNO3OptPREO2OptPREO33K6K9K12K15KSE +/- 27.28, N = 3SE +/- 20.01, N = 3SE +/- 58.95, N = 3SE +/- 42.13, N = 3SE +/- 15.63, N = 3SE +/- 24.95, N = 3SE +/- 107.49, N = 3SE +/- 33.86, N = 3SE +/- 37.49, N = 3SE +/- 41.31, N = 3SE +/- 97.36, N = 3SE +/- 75.96, N = 3SE +/- 78.24, N = 3SE +/- 20.00, N = 3SE +/- 76.47, N = 3SE +/- 53.18, N = 3SE +/- 16.00, N = 31563115571155681556415551155431553915538155261551315481154611545715428154261540715389

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128NoGVNO3OptPREO2debug2 x Intel Xeon E5-2620 v2OptNoSimplO3OptPREO3OptSimplO2OptSimplO3GVNO2NoGVNO2NewGVNO2OptRedO2NewGVNO333OptRedO3OptNoSimplO2PessimisticO3GVNO38001600240032004000SE +/- 3.71, N = 3SE +/- 11.59, N = 3SE +/- 4.06, N = 3SE +/- 9.25, N = 3SE +/- 21.66, N = 3SE +/- 29.76, N = 3SE +/- 18.17, N = 3SE +/- 4.22, N = 3SE +/- 21.71, N = 3SE +/- 7.88, N = 3SE +/- 17.74, N = 3SE +/- 14.94, N = 3SE +/- 30.58, N = 3SE +/- 28.74, N = 3SE +/- 31.53, N = 3SE +/- 29.21, N = 3SE +/- 4.10, N = 33861.63855.93855.83848.03839.43837.33836.53836.43836.23835.53834.93831.93824.53821.53812.53812.33805.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096GVNO3GVNO2PessimisticO3NewGVNO333OptPREO2NewGVNO2OptRedO2OptRedO3OptPREO3OptSimplO3OptNoSimplO3debugNoGVNO3OptSimplO2OptNoSimplO22 x Intel Xeon E5-2620 v2NoGVNO28001600240032004000SE +/- 8.49, N = 3SE +/- 9.96, N = 3SE +/- 5.80, N = 3SE +/- 0.15, N = 3SE +/- 11.64, N = 3SE +/- 4.58, N = 3SE +/- 4.75, N = 3SE +/- 5.34, N = 3SE +/- 4.73, N = 3SE +/- 15.54, N = 3SE +/- 13.95, N = 3SE +/- 17.36, N = 3SE +/- 13.90, N = 3SE +/- 7.30, N = 3SE +/- 1.10, N = 3SE +/- 8.30, N = 3SE +/- 8.63, N = 33950.03936.03934.63934.43931.33924.23923.93921.73920.73919.33917.43914.83912.13911.53909.53907.03895.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64PessimisticO3OptSimplO2NoGVNO2debugOptSimplO3OptPREO2OptNoSimplO3GVNO32 x Intel Xeon E5-2620 v2OptRedO3NoGVNO3NewGVNO2OptPREO3OptRedO2NewGVNO333GVNO2OptNoSimplO211002200330044005500SE +/- 4.79, N = 3SE +/- 6.01, N = 3SE +/- 3.97, N = 3SE +/- 2.78, N = 3SE +/- 3.52, N = 3SE +/- 11.20, N = 3SE +/- 4.42, N = 3SE +/- 6.59, N = 3SE +/- 7.41, N = 3SE +/- 8.65, N = 3SE +/- 5.21, N = 3SE +/- 2.54, N = 3SE +/- 10.53, N = 3SE +/- 7.40, N = 3SE +/- 0.13, N = 3SE +/- 14.95, N = 3SE +/- 61.09, N = 34914.14911.44907.74906.84905.74902.94899.94899.64899.54899.14897.94894.24893.84892.04891.04885.04846.9