fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2410236-NE-FFTW120RU61
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
debug
July 13
  1 Hour, 54 Minutes
NoGVNO2
July 14
  57 Minutes
NoGVNO3
July 15
  57 Minutes
OptNoSimplO2
July 16
  17 Minutes
2 x Intel Xeon E5-2620 v2
July 17
  17 Minutes
OptNoSimplO3
July 19
  39 Minutes
OptSimplO2
July 21
  59 Minutes
OptSimplO3
July 22
  58 Minutes
OptRedO2
July 27
  57 Minutes
OptRedO3
July 28
  58 Minutes
OptPREO2
August 06
  59 Minutes
GVNO2
August 07
  58 Minutes
NewGVNO2
August 09
  58 Minutes
OptPREO3
August 12
  58 Minutes
NewGVNO333
August 28
  58 Minutes
GVNO3
August 29
  59 Minutes
PessimisticO3
September 04
  58 Minutes
PessimisticO2
September 05
  57 Minutes
PessimisticNewGVNO2
September 06
  57 Minutes
PessimisticNewGVNO3
September 07
  59 Minutes
NewGVNO2-debug
October 23
  50 Minutes
Invert Hiding All Results Option
  55 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


fftw-1.2.0runProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutiondebugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debug2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x7682000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850ASUS VW1905.14.0-474.el9.x86_64 (x86_64)5.14.0-480.el9.x86_64 (x86_64)256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P5.14.0-496.el9.x86_64 (x86_64)GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.25.14.0-503.el9.x86_64 (x86_64)5.14.0-514.el9.x86_64 (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptNoSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptRedO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptRedO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptPREO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- GVNO2: CXXFLAGS=-O2 CFLAGS=-O2- NewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- OptPREO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO333: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- GVNO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO2: CXXFLAGS=-O2 CFLAGS=-O2- PessimisticNewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- PessimisticNewGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO2-debug: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"Compiler Details- Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686Processor Details- Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42eSecurity Details- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugResult OverviewPhoronix Test Suite100%164%229%293%FFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFloat + SSE - 1D FFT Size 1024Float + SSE - 1D FFT Size 2048Float + SSE - 1D FFT Size 512Float + SSE - 1D FFT Size 4096Float + SSE - 1D FFT Size 256Float + SSE - 2D FFT Size 32Float + SSE - 2D FFT Size 64Float + SSE - 2D FFT Size 1024Float + SSE - 1D FFT Size 128Float + SSE - 2D FFT Size 128Float + SSE - 2D FFT Size 512Float + SSE - 2D FFT Size 256Float + SSE - 2D FFT Size 2048Float + SSE - 2D FFT Size 4096Float + SSE - 1D FFT Size 64Float + SSE - 1D FFT Size 32Stock - 2D FFT Size 4096Stock - 2D FFT Size 1024Stock - 2D FFT Size 2048Stock - 2D FFT Size 64Stock - 1D FFT Size 128Stock - 2D FFT Size 512Stock - 2D FFT Size 256Stock - 1D FFT Size 1024Stock - 1D FFT Size 32Stock - 1D FFT Size 256Stock - 1D FFT Size 512Stock - 2D FFT Size 32Stock - 1D FFT Size 2048Stock - 2D FFT Size 128Stock - 1D FFT Size 4096Stock - 1D FFT Size 64

fftw-1.2.0runfftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 1D FFT Size 32fftw: Stock - 2D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 128fftw: Stock - 2D FFT Size 512fftw: Stock - 2D FFT Size 256fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 32fftw: Stock - 1D FFT Size 2048fftw: Stock - 2D FFT Size 128fftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debug1626615513155521409113679164401498210890.7127021172911870114146459.16092.79870.67287.02751.43123.12706.64309.24420.63780.73660.64285.64998.54296.24320.45218.74055.63855.83914.84906.81614115461155601418613989165481587812316130261174311803113997027.06717.9103507311.42756.03091.62723.74336.74414.63773.33686.74282.94967.14308.34344.45211.54051.33835.53895.44907.71605915457154711435314032165261595212221132061195211750115117077.76717.1104707347.42738.93107.22736.04289.84379.53797.53693.34270.05020.84290.54345.75219.04055.03861.63912.14897.91563515543152891424313651134711350211903127891218811714110946728.25603.7101316805.02231.13053.12662.04337.14358.83679.23671.34233.75008.94309.24340.15127.94046.93812.53909.54846.91551715539152701425913831133941363611906127401229511715110876665.95618.5102296925.72239.33048.52642.14336.64354.33736.53634.94178.45024.64316.14285.85115.04046.43848.03907.04899.51563015551154001431713795134141374011949128501259111778111376731.15634.29964.46912.72200.03018.72664.34354.74364.23751.13616.94198.14898.04312.84314.45144.64031.33839.43917.44899.91596215526154141433714455164441578212337132451157811780114407024.06694.6102567116.02757.93110.32726.64318.64408.43748.43642.84276.04962.74275.64334.25217.94030.63836.53911.54911.41603715631155741418513983163731598112182130581193411858113787072.16707.9103217041.32758.93083.82725.04326.34417.13759.43673.04286.34963.84247.74346.85213.44039.53836.43919.34905.71609915568152941423414125166681598912202130721160911836113567021.66697.3103547252.42773.93123.42728.14320.84415.53780.63622.24280.94957.94288.14340.05189.34048.33831.93923.94892.01616915538154041443013997165191584512298130001194311709113377059.06718.8103977194.92767.13149.12707.24337.34415.33784.43655.84252.94990.14329.54367.85220.54037.23821.53921.74899.11609015407155381427914247163441538812125131671178811821114657005.56688.410357.87355.92762.43126.02710.84286.44402.83775.93641.64267.95000.04314.34366.85144.84045.53855.93931.34902.91616715428157041436314044166401603512218128311173911841114116983.66703.29996.37205.12755.03142.02716.24206.74433.23767.93589.54232.84997.44300.94298.45179.13989.43836.23936.04885.01623415564156121443814150163441590612213129081179611652114086988.06697.2104187397.62752.93138.12727.84316.44405.03769.93648.04263.25005.74308.34334.25213.24051.53834.93924.24894.21614315389152101397914266163481614112132131431178111893114826992.76707.0103157126.72755.43085.72700.44321.44427.23785.13583.54288.45008.84241.64338.55209.04034.43837.33920.74893.81593915571155451443414009165541611011990132191199111803115487080.86732.210279.57316.42745.83114.72718.04265.14294.03776.93610.34286.75021.34317.04332.95180.44053.43824.53934.44891.01611915426155761428914124163111584212064131461179511818114677049.26724.5103727243.12779.83120.22705.04347.44327.33761.93654.94290.65019.84313.44352.15176.44042.03805.33950.04899.61622315481155251437513794164871561112131130191182011821114807085.36716.1104107166.12774.83133.02726.84280.94434.73780.83603.64282.54979.94289.24350.45176.14063.83812.33934.64914.11597915338154031432214365164481599412129130951157711806114027027.56692.2105987191.32766.53097.42721.24291.04405.53776.03629.74265.55004.94335.84354.25201.14034.33865.13927.14893.41616515367153801429514328165481578712242130311172611693113937001.66718.4105297420.82758.03118.22731.44260.54412.33768.03663.64271.75027.44301.84319.95173.54046.43829.83932.34899.61606515556155081444414091163781587312178132381174611838114347068.96710.810378.07379.92744.43134.62707.54287.64295.73772.53643.34276.95020.04293.24332.45208.74049.13855.43949.14884.44548.84427.54454.74298.54442.75238.95134.14191.74526.94335.84134.84121.43286.83177.75058.95044.32801.53116.82725.84216.34406.13768.53684.24248.95009.94305.04278.35196.94045.03868.53920.94897.4OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024debugNewGVNO2PessimisticO3OptRedO3GVNO2PessimisticNewGVNO2OptPREO3NoGVNO2GVNO3OptRedO2OptPREO2PessimisticNewGVNO3NoGVNO3OptSimplO3PessimisticO2OptSimplO2NewGVNO333OptNoSimplO2OptNoSimplO32 x Intel Xeon E5-2620 v2NewGVNO2-debug3K6K9K12K15KSE +/- 80.85, N = 3SE +/- 75.24, N = 3SE +/- 78.81, N = 3SE +/- 94.45, N = 3SE +/- 78.73, N = 3SE +/- 64.06, N = 3SE +/- 21.01, N = 3SE +/- 67.72, N = 3SE +/- 15.17, N = 3SE +/- 85.56, N = 3SE +/- 82.71, N = 3SE +/- 86.33, N = 3SE +/- 80.68, N = 3SE +/- 141.62, N = 3SE +/- 135.76, N = 3SE +/- 73.63, N = 3SE +/- 129.47, N = 3SE +/- 43.66, N = 3SE +/- 32.22, N = 3SE +/- 112.78, N = 3SE +/- 9.20, N = 316266.016234.016223.016169.016167.016165.016143.016141.016119.016099.016090.016065.016059.016037.015979.015962.015939.015635.015630.015517.04548.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048OptSimplO3NewGVNO333OptRedO2NewGVNO2PessimisticNewGVNO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptRedO3OptSimplO2debugPessimisticO3NoGVNO2NoGVNO3GVNO2GVNO3OptPREO2OptPREO3PessimisticNewGVNO2PessimisticO2NewGVNO2-debug3K6K9K12K15KSE +/- 27.28, N = 3SE +/- 20.01, N = 3SE +/- 58.95, N = 3SE +/- 42.13, N = 3SE +/- 17.24, N = 3SE +/- 15.63, N = 3SE +/- 24.95, N = 3SE +/- 107.49, N = 3SE +/- 33.86, N = 3SE +/- 37.49, N = 3SE +/- 41.31, N = 3SE +/- 97.36, N = 3SE +/- 75.96, N = 3SE +/- 78.24, N = 3SE +/- 20.00, N = 3SE +/- 76.47, N = 3SE +/- 53.18, N = 3SE +/- 16.00, N = 3SE +/- 143.99, N = 3SE +/- 130.85, N = 3SE +/- 17.00, N = 315631.015571.015568.015564.015556.015551.015543.015539.015538.015526.015513.015481.015461.015457.015428.015426.015407.015389.015367.015338.04427.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512GVNO2NewGVNO2GVNO3OptSimplO3NoGVNO2debugNewGVNO333OptPREO2PessimisticO3PessimisticNewGVNO3NoGVNO3OptSimplO2OptRedO3PessimisticO2OptNoSimplO3PessimisticNewGVNO2OptRedO2OptNoSimplO22 x Intel Xeon E5-2620 v2OptPREO3NewGVNO2-debug3K6K9K12K15KSE +/- 89.29, N = 3SE +/- 71.36, N = 3SE +/- 109.69, N = 3SE +/- 63.39, N = 3SE +/- 81.91, N = 3SE +/- 123.58, N = 3SE +/- 163.42, N = 3SE +/- 127.81, N = 3SE +/- 168.49, N = 3SE +/- 143.51, N = 3SE +/- 120.34, N = 3SE +/- 60.73, N = 3SE +/- 104.45, N = 3SE +/- 89.79, N = 3SE +/- 66.70, N = 3SE +/- 109.70, N = 3SE +/- 195.78, N = 3SE +/- 157.18, N = 3SE +/- 127.67, N = 3SE +/- 45.83, N = 3SE +/- 27.78, N = 315704.015612.015576.015574.015560.015552.015545.015538.015525.015508.015471.015414.015404.015403.015400.015380.015294.015289.015270.015210.04454.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096PessimisticNewGVNO3NewGVNO2NewGVNO333OptRedO3PessimisticO3GVNO2NoGVNO3OptSimplO2PessimisticO2OptNoSimplO3PessimisticNewGVNO2GVNO3OptPREO22 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO2NoGVNO2OptSimplO3debugOptPREO3NewGVNO2-debug3K6K9K12K15KSE +/- 110.96, N = 3SE +/- 56.04, N = 3SE +/- 25.71, N = 3SE +/- 94.63, N = 3SE +/- 33.27, N = 3SE +/- 46.84, N = 3SE +/- 135.86, N = 3SE +/- 123.09, N = 3SE +/- 9.50, N = 3SE +/- 55.79, N = 3SE +/- 111.29, N = 3SE +/- 41.07, N = 3SE +/- 55.00, N = 3SE +/- 82.03, N = 3SE +/- 19.46, N = 3SE +/- 62.13, N = 3SE +/- 80.21, N = 3SE +/- 94.10, N = 3SE +/- 193.73, N = 3SE +/- 89.82, N = 3SE +/- 4.59, N = 314444.014438.014434.014430.014375.014363.014353.014337.014322.014317.014295.014289.014279.014259.014243.014234.014186.014185.014091.013979.04298.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256OptSimplO2PessimisticO2PessimisticNewGVNO2OptPREO3OptPREO2NewGVNO2OptRedO2GVNO3PessimisticNewGVNO3GVNO2NoGVNO3NewGVNO333OptRedO3NoGVNO2OptSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO3PessimisticO3debugOptNoSimplO2NewGVNO2-debug3K6K9K12K15KSE +/- 115.02, N = 3SE +/- 169.66, N = 3SE +/- 137.11, N = 3SE +/- 127.87, N = 3SE +/- 145.87, N = 3SE +/- 71.88, N = 3SE +/- 145.93, N = 4SE +/- 169.36, N = 3SE +/- 95.21, N = 13SE +/- 150.68, N = 4SE +/- 28.45, N = 3SE +/- 146.24, N = 4SE +/- 165.24, N = 3SE +/- 30.75, N = 3SE +/- 36.46, N = 3SE +/- 67.57, N = 3SE +/- 58.29, N = 3SE +/- 101.06, N = 3SE +/- 98.34, N = 3SE +/- 68.09, N = 3SE +/- 15.23, N = 314455.014365.014328.014266.014247.014150.014125.014124.014091.014044.014032.014009.013997.013989.013983.013831.013795.013794.013679.013651.04442.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32OptRedO2GVNO2NewGVNO333PessimisticNewGVNO2NoGVNO2NoGVNO3OptRedO3PessimisticO3PessimisticO2OptSimplO2debugPessimisticNewGVNO3OptSimplO3OptPREO3NewGVNO2OptPREO2GVNO3OptNoSimplO2OptNoSimplO32 x Intel Xeon E5-2620 v2NewGVNO2-debug4K8K12K16K20KSE +/- 42.15, N = 3SE +/- 87.96, N = 3SE +/- 81.03, N = 3SE +/- 123.12, N = 3SE +/- 136.41, N = 3SE +/- 139.56, N = 3SE +/- 23.39, N = 3SE +/- 72.72, N = 3SE +/- 107.68, N = 3SE +/- 64.02, N = 3SE +/- 98.77, N = 3SE +/- 35.88, N = 3SE +/- 92.64, N = 3SE +/- 33.40, N = 3SE +/- 197.17, N = 3SE +/- 142.90, N = 3SE +/- 194.05, N = 3SE +/- 131.37, N = 3SE +/- 102.53, N = 3SE +/- 107.35, N = 3SE +/- 4.57, N = 316668.016640.016554.016548.016548.016526.016519.016487.016448.016444.016440.016378.016373.016348.016344.016344.016311.013471.013414.013394.05238.9

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64OptPREO3NewGVNO333GVNO2PessimisticO2OptRedO2OptSimplO3NoGVNO3NewGVNO2NoGVNO2PessimisticNewGVNO3OptRedO3GVNO3PessimisticNewGVNO2OptSimplO2PessimisticO3OptPREO2debugOptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2NewGVNO2-debug3K6K9K12K15KSE +/- 39.26, N = 3SE +/- 39.26, N = 3SE +/- 50.86, N = 3SE +/- 107.04, N = 3SE +/- 23.46, N = 3SE +/- 82.53, N = 3SE +/- 133.36, N = 3SE +/- 181.67, N = 3SE +/- 70.08, N = 3SE +/- 49.59, N = 3SE +/- 86.00, N = 3SE +/- 125.58, N = 15SE +/- 148.65, N = 7SE +/- 136.22, N = 15SE +/- 137.71, N = 15SE +/- 159.78, N = 15SE +/- 26.91, N = 3SE +/- 55.77, N = 3SE +/- 70.91, N = 3SE +/- 41.97, N = 3SE +/- 10.34, N = 316141.016110.016035.015994.015989.015981.015952.015906.015878.015873.015845.015842.015787.015782.015611.015388.014982.013740.013636.013502.05134.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024OptSimplO2NoGVNO2OptRedO3PessimisticNewGVNO2NoGVNO3GVNO2NewGVNO2OptRedO2OptSimplO3PessimisticNewGVNO3OptPREO3PessimisticO3PessimisticO2OptPREO2GVNO3NewGVNO333OptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2debugNewGVNO2-debug3K6K9K12K15KSE +/- 48.60, N = 3SE +/- 81.04, N = 3SE +/- 15.68, N = 3SE +/- 81.50, N = 3SE +/- 76.28, N = 3SE +/- 69.17, N = 3SE +/- 63.84, N = 3SE +/- 76.29, N = 3SE +/- 70.39, N = 3SE +/- 55.07, N = 3SE +/- 74.77, N = 3SE +/- 46.46, N = 3SE +/- 44.50, N = 3SE +/- 58.30, N = 3SE +/- 43.51, N = 3SE +/- 72.25, N = 3SE +/- 58.86, N = 3SE +/- 40.01, N = 3SE +/- 124.86, N = 3SE +/- 644.68, N = 12SE +/- 4.89, N = 312337.012316.012298.012242.012221.012218.012213.012202.012182.012178.012132.012131.012129.012125.012064.011990.011949.011906.011903.010890.74191.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128OptSimplO2PessimisticNewGVNO3NewGVNO333NoGVNO3OptPREO2GVNO3OptPREO3PessimisticO2OptRedO2OptSimplO3PessimisticNewGVNO2NoGVNO2PessimisticO3OptRedO3NewGVNO2OptNoSimplO3GVNO2OptNoSimplO22 x Intel Xeon E5-2620 v2debugNewGVNO2-debug3K6K9K12K15KSE +/- 45.94, N = 3SE +/- 63.14, N = 3SE +/- 46.51, N = 3SE +/- 33.80, N = 3SE +/- 132.00, N = 5SE +/- 52.35, N = 3SE +/- 120.91, N = 3SE +/- 165.41, N = 3SE +/- 133.00, N = 3SE +/- 130.44, N = 3SE +/- 93.99, N = 3SE +/- 160.95, N = 3SE +/- 90.08, N = 3SE +/- 67.87, N = 3SE +/- 169.95, N = 3SE +/- 80.49, N = 3SE +/- 79.22, N = 3SE +/- 84.00, N = 3SE +/- 136.77, N = 3SE +/- 136.03, N = 15SE +/- 0.33, N = 313245.013238.013219.013206.013167.013146.013143.013095.013072.013058.013031.013026.013019.013000.012908.012850.012831.012789.012740.012702.04526.9

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128OptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2NewGVNO333NoGVNO3OptRedO3OptSimplO3PessimisticO3NewGVNO2GVNO3OptPREO2OptPREO3PessimisticNewGVNO3NoGVNO2GVNO2debugPessimisticNewGVNO2OptRedO2OptSimplO2PessimisticO2NewGVNO2-debug3K6K9K12K15KSE +/- 50.90, N = 3SE +/- 82.96, N = 15SE +/- 113.85, N = 15SE +/- 86.04, N = 3SE +/- 92.80, N = 3SE +/- 104.94, N = 8SE +/- 47.40, N = 3SE +/- 101.06, N = 3SE +/- 91.43, N = 10SE +/- 105.79, N = 7SE +/- 94.72, N = 15SE +/- 147.41, N = 3SE +/- 118.42, N = 3SE +/- 95.52, N = 3SE +/- 95.45, N = 3SE +/- 84.15, N = 12SE +/- 62.19, N = 3SE +/- 48.56, N = 3SE +/- 111.70, N = 3SE +/- 136.55, N = 3SE +/- 1.05, N = 312591.012295.012188.011991.011952.011943.011934.011820.011796.011795.011788.011781.011746.011743.011739.011729.011726.011609.011578.011577.04335.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512OptPREO3debugOptSimplO3GVNO2PessimisticNewGVNO3OptRedO2PessimisticO3OptPREO2GVNO3PessimisticO2NewGVNO333NoGVNO2OptSimplO2OptNoSimplO3NoGVNO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO3PessimisticNewGVNO2NewGVNO2NewGVNO2-debug3K6K9K12K15KSE +/- 57.89, N = 3SE +/- 97.11, N = 3SE +/- 36.67, N = 3SE +/- 40.29, N = 3SE +/- 127.52, N = 3SE +/- 93.72, N = 3SE +/- 60.34, N = 3SE +/- 83.95, N = 3SE +/- 52.21, N = 3SE +/- 74.64, N = 3SE +/- 41.82, N = 3SE +/- 84.01, N = 3SE +/- 128.25, N = 3SE +/- 61.50, N = 3SE +/- 19.97, N = 3SE +/- 37.37, N = 3SE +/- 64.54, N = 3SE +/- 39.68, N = 3SE +/- 106.71, N = 3SE +/- 64.51, N = 3SE +/- 1.80, N = 311893.011870.011858.011841.011838.011836.011821.011821.011818.011806.011803.011803.011780.011778.011750.011715.011714.011709.011693.011652.04134.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 256NewGVNO333NoGVNO3OptPREO3PessimisticO3GVNO3OptPREO2OptSimplO2PessimisticNewGVNO3debugGVNO2NewGVNO2PessimisticO2NoGVNO2PessimisticNewGVNO2OptSimplO3OptRedO2OptRedO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2NewGVNO2-debug2K4K6K8K10KSE +/- 65.80, N = 3SE +/- 40.95, N = 3SE +/- 60.95, N = 3SE +/- 62.20, N = 3SE +/- 34.04, N = 3SE +/- 55.86, N = 3SE +/- 35.14, N = 3SE +/- 62.20, N = 3SE +/- 86.71, N = 3SE +/- 40.76, N = 3SE +/- 84.51, N = 3SE +/- 73.06, N = 3SE +/- 85.30, N = 3SE +/- 68.55, N = 3SE +/- 91.42, N = 3SE +/- 108.39, N = 3SE +/- 140.84, N = 3SE +/- 21.39, N = 3SE +/- 115.75, N = 3SE +/- 119.21, N = 3SE +/- 11.50, N = 311548.011511.011482.011480.011467.011465.011440.011434.011414.011411.011408.011402.011399.011393.011378.011356.011337.011137.011094.011087.04121.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048PessimisticO3NewGVNO333NoGVNO3OptSimplO3PessimisticNewGVNO3OptRedO3GVNO3PessimisticO2NoGVNO2OptSimplO2OptRedO2OptPREO2PessimisticNewGVNO2OptPREO3NewGVNO2GVNO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2debugNewGVNO2-debug15003000450060007500SE +/- 10.65, N = 3SE +/- 2.80, N = 3SE +/- 8.17, N = 3SE +/- 14.30, N = 3SE +/- 14.84, N = 3SE +/- 28.87, N = 3SE +/- 2.26, N = 3SE +/- 24.24, N = 3SE +/- 11.97, N = 3SE +/- 15.79, N = 3SE +/- 19.13, N = 3SE +/- 26.38, N = 3SE +/- 48.51, N = 3SE +/- 43.40, N = 3SE +/- 12.08, N = 3SE +/- 18.93, N = 3SE +/- 25.37, N = 3SE +/- 33.67, N = 3SE +/- 41.03, N = 3SE +/- 44.49, N = 3SE +/- 7.16, N = 37085.37080.87077.77072.17068.97059.07049.27027.57027.07024.07021.67005.57001.66992.76988.06983.66731.16728.26665.96459.13286.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096NewGVNO333GVNO3OptRedO3PessimisticNewGVNO2NoGVNO2NoGVNO3PessimisticO3PessimisticNewGVNO3OptSimplO3OptPREO3GVNO2OptRedO2NewGVNO2OptSimplO2PessimisticO2OptPREO2debugOptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2NewGVNO2-debug14002800420056007000SE +/- 23.42, N = 3SE +/- 23.90, N = 3SE +/- 15.62, N = 3SE +/- 6.00, N = 3SE +/- 13.11, N = 3SE +/- 21.46, N = 3SE +/- 8.99, N = 3SE +/- 15.73, N = 3SE +/- 11.45, N = 3SE +/- 20.27, N = 3SE +/- 5.39, N = 3SE +/- 4.50, N = 3SE +/- 10.55, N = 3SE +/- 9.08, N = 3SE +/- 22.81, N = 3SE +/- 15.10, N = 3SE +/- 262.37, N = 9SE +/- 7.27, N = 3SE +/- 8.82, N = 3SE +/- 4.28, N = 3SE +/- 8.33, N = 36732.26724.56718.86718.46717.96717.16716.16710.86707.96707.06703.26697.36697.26694.66692.26688.46092.75634.25618.55603.73177.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64PessimisticO2PessimisticNewGVNO2NoGVNO3NewGVNO2PessimisticO3OptRedO3PessimisticNewGVNO3GVNO3OptPREO2OptRedO2NoGVNO2OptSimplO3OptPREO3NewGVNO333OptSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO2GVNO2OptNoSimplO3debugNewGVNO2-debug2K4K6K8K10KSE +/- 17.46, N = 3SE +/- 31.09, N = 3SE +/- 114.62, N = 3SE +/- 85.65, N = 3SE +/- 10.59, N = 3SE +/- 18.11, N = 3SE +/- 80.37, N = 10SE +/- 40.83, N = 3SE +/- 69.65, N = 13SE +/- 18.19, N = 3SE +/- 99.10, N = 3SE +/- 15.38, N = 3SE +/- 7.77, N = 3SE +/- 101.28, N = 5SE +/- 141.95, N = 3SE +/- 54.50, N = 3SE +/- 33.87, N = 3SE +/- 144.31, N = 15SE +/- 98.25, N = 3SE +/- 172.96, N = 15SE +/- 54.47, N = 310598.010529.010470.010418.010410.010397.010378.010372.010357.810354.010350.010321.010315.010279.510256.010229.010131.09996.39964.49870.65058.9

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32PessimisticNewGVNO2NewGVNO2PessimisticNewGVNO3OptPREO2NoGVNO3NewGVNO333NoGVNO2debugOptRedO2GVNO3GVNO2OptRedO3PessimisticO2PessimisticO3OptPREO3OptSimplO2OptSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO3OptNoSimplO2NewGVNO2-debug16003200480064008000SE +/- 14.66, N = 3SE +/- 12.13, N = 3SE +/- 12.63, N = 3SE +/- 98.33, N = 3SE +/- 35.65, N = 3SE +/- 72.31, N = 3SE +/- 79.17, N = 3SE +/- 5.79, N = 3SE +/- 76.45, N = 5SE +/- 82.98, N = 4SE +/- 65.67, N = 7SE +/- 57.27, N = 15SE +/- 45.97, N = 3SE +/- 56.02, N = 15SE +/- 54.90, N = 15SE +/- 52.52, N = 15SE +/- 90.54, N = 15SE +/- 64.34, N = 15SE +/- 71.46, N = 15SE +/- 24.17, N = 3SE +/- 8.26, N = 37420.87397.67379.97355.97347.47316.47311.47287.07252.47243.17205.17194.97191.37166.17126.77116.07041.36925.76912.76805.05044.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096NewGVNO2-debugGVNO3PessimisticO3OptRedO2OptRedO3PessimisticO2OptPREO2OptSimplO3PessimisticNewGVNO2OptSimplO2NoGVNO2OptPREO3GVNO2NewGVNO2debugNewGVNO333PessimisticNewGVNO3NoGVNO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO36001200180024003000SE +/- 17.44, N = 3SE +/- 19.40, N = 3SE +/- 23.10, N = 3SE +/- 15.05, N = 3SE +/- 13.13, N = 3SE +/- 23.64, N = 3SE +/- 11.48, N = 3SE +/- 5.14, N = 3SE +/- 7.03, N = 3SE +/- 8.89, N = 3SE +/- 14.23, N = 3SE +/- 7.45, N = 3SE +/- 5.66, N = 3SE +/- 8.68, N = 3SE +/- 5.74, N = 3SE +/- 13.80, N = 3SE +/- 10.94, N = 3SE +/- 16.56, N = 3SE +/- 8.09, N = 3SE +/- 16.21, N = 3SE +/- 21.99, N = 152801.52779.82774.82773.92767.12766.52762.42758.92758.02757.92756.02755.42755.02752.92751.42745.82744.42738.92239.32231.12200.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024OptRedO3GVNO2NewGVNO2PessimisticNewGVNO3PessimisticO3OptPREO2OptRedO2debugGVNO3PessimisticNewGVNO2NewGVNO2-debugNewGVNO333OptSimplO2NoGVNO3PessimisticO2NoGVNO2OptPREO3OptSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO37001400210028003500SE +/- 9.39, N = 3SE +/- 5.57, N = 3SE +/- 18.27, N = 3SE +/- 17.57, N = 3SE +/- 1.11, N = 3SE +/- 8.38, N = 3SE +/- 14.89, N = 3SE +/- 15.08, N = 3SE +/- 12.60, N = 3SE +/- 24.52, N = 3SE +/- 23.65, N = 3SE +/- 24.45, N = 3SE +/- 26.70, N = 3SE +/- 38.36, N = 3SE +/- 31.08, N = 3SE +/- 13.41, N = 3SE +/- 15.98, N = 3SE +/- 10.08, N = 3SE +/- 12.20, N = 3SE +/- 13.57, N = 3SE +/- 8.28, N = 33149.13142.03138.13134.63133.03126.03123.43123.13120.23118.23116.83114.73110.33107.23097.43091.63085.73083.83053.13048.53018.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048NoGVNO3PessimisticNewGVNO2OptRedO2NewGVNO2PessimisticO3OptSimplO2NewGVNO2-debugOptSimplO3NoGVNO2PessimisticO2NewGVNO333GVNO2OptPREO2PessimisticNewGVNO3OptRedO3debugGVNO3OptPREO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v26001200180024003000SE +/- 12.56, N = 3SE +/- 12.84, N = 3SE +/- 16.99, N = 3SE +/- 19.22, N = 3SE +/- 5.76, N = 3SE +/- 17.93, N = 3SE +/- 3.87, N = 3SE +/- 16.84, N = 3SE +/- 8.21, N = 3SE +/- 14.64, N = 3SE +/- 6.49, N = 3SE +/- 13.80, N = 3SE +/- 13.94, N = 3SE +/- 4.65, N = 3SE +/- 12.30, N = 3SE +/- 3.24, N = 3SE +/- 18.48, N = 3SE +/- 11.61, N = 3SE +/- 6.56, N = 3SE +/- 6.15, N = 3SE +/- 5.49, N = 32736.02731.42728.12727.82726.82726.62725.82725.02723.72721.22718.02716.22710.82707.52707.22706.62705.02700.42664.32662.02642.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64OptNoSimplO3GVNO3OptRedO3OptNoSimplO2NoGVNO22 x Intel Xeon E5-2620 v2OptSimplO3OptPREO3OptRedO2OptSimplO2NewGVNO2debugPessimisticO2NoGVNO3PessimisticNewGVNO3OptPREO2PessimisticO3NewGVNO333PessimisticNewGVNO2NewGVNO2-debugGVNO29001800270036004500SE +/- 7.74, N = 3SE +/- 2.62, N = 3SE +/- 18.00, N = 3SE +/- 4.58, N = 3SE +/- 4.40, N = 3SE +/- 5.04, N = 3SE +/- 21.71, N = 3SE +/- 3.34, N = 3SE +/- 22.79, N = 3SE +/- 4.27, N = 3SE +/- 22.90, N = 3SE +/- 21.54, N = 3SE +/- 52.12, N = 4SE +/- 46.52, N = 4SE +/- 48.59, N = 4SE +/- 61.56, N = 3SE +/- 60.71, N = 3SE +/- 60.33, N = 3SE +/- 56.44, N = 3SE +/- 54.17, N = 3SE +/- 51.49, N = 44354.74347.44337.34337.14336.74336.64326.34321.44320.84318.64316.44309.24291.04289.84287.64286.44280.94265.14260.54216.34206.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128PessimisticO3GVNO2OptPREO3debugOptSimplO3OptRedO2OptRedO3NoGVNO2PessimisticNewGVNO2OptSimplO2NewGVNO2-debugPessimisticO2NewGVNO2OptPREO2NoGVNO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2GVNO3PessimisticNewGVNO3NewGVNO33310002000300040005000SE +/- 1.58, N = 3SE +/- 1.52, N = 3SE +/- 9.28, N = 3SE +/- 6.43, N = 3SE +/- 2.14, N = 3SE +/- 4.33, N = 3SE +/- 2.25, N = 3SE +/- 2.46, N = 3SE +/- 6.19, N = 3SE +/- 2.45, N = 3SE +/- 2.58, N = 3SE +/- 3.05, N = 3SE +/- 5.65, N = 3SE +/- 4.02, N = 3SE +/- 35.10, N = 9SE +/- 2.94, N = 3SE +/- 1.43, N = 3SE +/- 3.10, N = 3SE +/- 52.07, N = 15SE +/- 36.81, N = 15SE +/- 51.58, N = 154434.74433.24427.24420.64417.14415.54415.34414.64412.34408.44406.14405.54405.04402.84379.54364.24358.84354.34327.34295.74294.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512NoGVNO3OptPREO3OptRedO3PessimisticO3debugOptRedO2NewGVNO333PessimisticO2OptPREO2NoGVNO2PessimisticNewGVNO3NewGVNO2NewGVNO2-debugPessimisticNewGVNO2GVNO2GVNO3OptSimplO3OptNoSimplO3OptSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO28001600240032004000SE +/- 10.93, N = 3SE +/- 3.13, N = 3SE +/- 7.10, N = 3SE +/- 21.49, N = 3SE +/- 10.97, N = 3SE +/- 6.77, N = 3SE +/- 14.05, N = 3SE +/- 11.78, N = 3SE +/- 16.55, N = 3SE +/- 13.13, N = 3SE +/- 2.02, N = 3SE +/- 20.08, N = 3SE +/- 5.82, N = 3SE +/- 14.01, N = 3SE +/- 5.58, N = 3SE +/- 6.57, N = 3SE +/- 9.69, N = 3SE +/- 14.36, N = 3SE +/- 34.42, N = 12SE +/- 41.40, N = 5SE +/- 38.03, N = 53797.53785.13784.43780.83780.73780.63776.93776.03775.93773.33772.53769.93768.53768.03767.93761.93759.43751.13748.43736.53679.2

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256NoGVNO3NoGVNO2NewGVNO2-debugOptSimplO3OptNoSimplO2PessimisticNewGVNO2debugOptRedO3GVNO3NewGVNO2PessimisticNewGVNO3OptSimplO2OptPREO22 x Intel Xeon E5-2620 v2PessimisticO2OptRedO2OptNoSimplO3NewGVNO333PessimisticO3GVNO2OptPREO38001600240032004000SE +/- 7.11, N = 3SE +/- 13.35, N = 3SE +/- 1.36, N = 3SE +/- 34.82, N = 3SE +/- 17.09, N = 3SE +/- 4.70, N = 3SE +/- 23.34, N = 3SE +/- 33.79, N = 3SE +/- 12.33, N = 3SE +/- 33.28, N = 3SE +/- 11.87, N = 3SE +/- 34.93, N = 3SE +/- 12.43, N = 3SE +/- 23.89, N = 3SE +/- 20.14, N = 3SE +/- 16.18, N = 3SE +/- 37.88, N = 3SE +/- 23.72, N = 3SE +/- 50.06, N = 3SE +/- 47.63, N = 3SE +/- 35.49, N = 33693.33686.73684.23673.03671.33663.63660.63655.83654.93648.03643.33642.83641.63634.93629.73622.23616.93610.33603.63589.53583.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GVNO3OptPREO3NewGVNO333OptSimplO3debugNoGVNO2PessimisticO3OptRedO2PessimisticNewGVNO3OptSimplO2PessimisticNewGVNO2NoGVNO3OptPREO2PessimisticO2NewGVNO2OptRedO3NewGVNO2-debugOptNoSimplO2GVNO2OptNoSimplO32 x Intel Xeon E5-2620 v29001800270036004500SE +/- 8.12, N = 3SE +/- 4.55, N = 3SE +/- 7.01, N = 3SE +/- 4.50, N = 3SE +/- 5.54, N = 3SE +/- 2.87, N = 3SE +/- 10.52, N = 3SE +/- 4.77, N = 3SE +/- 28.59, N = 3SE +/- 11.99, N = 3SE +/- 6.86, N = 3SE +/- 19.30, N = 3SE +/- 14.59, N = 3SE +/- 26.82, N = 3SE +/- 9.92, N = 3SE +/- 32.58, N = 3SE +/- 21.17, N = 3SE +/- 8.01, N = 3SE +/- 48.65, N = 4SE +/- 25.83, N = 3SE +/- 59.35, N = 34290.64288.44286.74286.34285.64282.94282.54280.94276.94276.04271.74270.04267.94265.54263.24252.94248.94233.74232.84198.14178.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32PessimisticNewGVNO22 x Intel Xeon E5-2620 v2NewGVNO333NoGVNO3PessimisticNewGVNO3GVNO3NewGVNO2-debugOptNoSimplO2OptPREO3NewGVNO2PessimisticO2OptPREO2debugGVNO2OptRedO3PessimisticO3NoGVNO2OptSimplO3OptSimplO2OptRedO2OptNoSimplO311002200330044005500SE +/- 1.59, N = 3SE +/- 4.74, N = 3SE +/- 2.77, N = 3SE +/- 23.10, N = 3SE +/- 8.67, N = 3SE +/- 26.35, N = 3SE +/- 18.86, N = 3SE +/- 17.04, N = 3SE +/- 18.68, N = 3SE +/- 31.02, N = 3SE +/- 24.70, N = 3SE +/- 15.07, N = 3SE +/- 29.89, N = 3SE +/- 13.08, N = 3SE +/- 13.97, N = 3SE +/- 15.16, N = 3SE +/- 8.35, N = 3SE +/- 39.97, N = 3SE +/- 25.51, N = 3SE +/- 31.02, N = 3SE +/- 95.20, N = 135027.45024.65021.35020.85020.05019.85009.95008.95008.85005.75004.95000.04998.54997.44990.14979.94967.14963.84962.74957.94898.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256PessimisticO2OptRedO3NewGVNO3332 x Intel Xeon E5-2620 v2OptPREO2GVNO3OptNoSimplO3OptNoSimplO2NewGVNO2NoGVNO2NewGVNO2-debugPessimisticNewGVNO2GVNO2debugPessimisticNewGVNO3NoGVNO3PessimisticO3OptRedO2OptSimplO2OptSimplO3OptPREO39001800270036004500SE +/- 12.05, N = 3SE +/- 17.30, N = 3SE +/- 17.44, N = 3SE +/- 27.37, N = 3SE +/- 3.18, N = 3SE +/- 19.87, N = 3SE +/- 24.49, N = 3SE +/- 1.60, N = 3SE +/- 5.17, N = 3SE +/- 1.57, N = 3SE +/- 7.39, N = 3SE +/- 2.74, N = 3SE +/- 24.27, N = 3SE +/- 1.72, N = 3SE +/- 4.79, N = 3SE +/- 15.53, N = 3SE +/- 10.82, N = 3SE +/- 22.07, N = 3SE +/- 8.48, N = 3SE +/- 53.04, N = 3SE +/- 43.31, N = 34335.84329.54317.04316.14314.34313.44312.84309.24308.34308.34305.04301.84300.94296.24293.24290.54289.24288.14275.64247.74241.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 512OptRedO3OptPREO2PessimisticO2GVNO3PessimisticO3OptSimplO3NoGVNO3NoGVNO2OptNoSimplO2OptRedO2OptPREO3NewGVNO2OptSimplO2NewGVNO333PessimisticNewGVNO3debugPessimisticNewGVNO2OptNoSimplO3GVNO22 x Intel Xeon E5-2620 v2NewGVNO2-debug9001800270036004500SE +/- 10.85, N = 3SE +/- 8.76, N = 3SE +/- 18.08, N = 3SE +/- 15.02, N = 3SE +/- 16.20, N = 3SE +/- 4.56, N = 3SE +/- 15.04, N = 3SE +/- 12.50, N = 3SE +/- 2.84, N = 3SE +/- 3.08, N = 3SE +/- 13.39, N = 3SE +/- 7.37, N = 3SE +/- 27.65, N = 3SE +/- 10.16, N = 3SE +/- 19.75, N = 3SE +/- 16.10, N = 3SE +/- 32.02, N = 3SE +/- 11.47, N = 3SE +/- 23.58, N = 3SE +/- 22.16, N = 3SE +/- 38.65, N = 34367.84366.84354.24352.14350.44346.84345.74344.44340.14340.04338.54334.24334.24332.94332.44320.44319.94314.44298.44285.84278.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32OptRedO3NoGVNO3debugOptSimplO2OptSimplO3NewGVNO2NoGVNO2OptPREO3PessimisticNewGVNO3PessimisticO2NewGVNO2-debugOptRedO2NewGVNO333GVNO2GVNO3PessimisticO3PessimisticNewGVNO2OptPREO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v211002200330044005500SE +/- 11.16, N = 3SE +/- 6.35, N = 3SE +/- 7.51, N = 3SE +/- 6.97, N = 3SE +/- 12.54, N = 3SE +/- 1.29, N = 3SE +/- 12.52, N = 3SE +/- 5.69, N = 3SE +/- 6.51, N = 3SE +/- 11.17, N = 3SE +/- 27.38, N = 3SE +/- 32.60, N = 3SE +/- 11.03, N = 3SE +/- 33.03, N = 3SE +/- 39.92, N = 3SE +/- 36.57, N = 3SE +/- 33.93, N = 3SE +/- 55.21, N = 3SE +/- 41.81, N = 3SE +/- 43.81, N = 3SE +/- 6.03, N = 35220.55219.05218.75217.95213.45213.25211.55209.05208.75201.15196.95189.35180.45179.15176.45176.15173.55144.85144.65127.95115.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048PessimisticO3debugNoGVNO3NewGVNO333NewGVNO2NoGVNO2PessimisticNewGVNO3OptRedO2OptNoSimplO2PessimisticNewGVNO22 x Intel Xeon E5-2620 v2OptPREO2NewGVNO2-debugGVNO3OptSimplO3OptRedO3OptPREO3PessimisticO2OptNoSimplO3OptSimplO2GVNO29001800270036004500SE +/- 9.60, N = 3SE +/- 6.96, N = 3SE +/- 8.12, N = 3SE +/- 8.15, N = 3SE +/- 12.48, N = 3SE +/- 6.84, N = 3SE +/- 11.53, N = 3SE +/- 14.17, N = 3SE +/- 18.65, N = 3SE +/- 4.42, N = 3SE +/- 11.79, N = 3SE +/- 4.44, N = 3SE +/- 4.58, N = 3SE +/- 14.25, N = 3SE +/- 7.07, N = 3SE +/- 22.16, N = 3SE +/- 16.83, N = 3SE +/- 6.35, N = 3SE +/- 6.58, N = 3SE +/- 14.04, N = 3SE +/- 47.88, N = 44063.84055.64055.04053.44051.54051.34049.14048.34046.94046.44046.44045.54045.04042.04039.54037.24034.44034.34031.34030.63989.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128NewGVNO2-debugPessimisticO2NoGVNO3OptPREO2debugPessimisticNewGVNO32 x Intel Xeon E5-2620 v2OptNoSimplO3OptPREO3OptSimplO2OptSimplO3GVNO2NoGVNO2NewGVNO2OptRedO2PessimisticNewGVNO2NewGVNO333OptRedO3OptNoSimplO2PessimisticO3GVNO38001600240032004000SE +/- 1.97, N = 3SE +/- 2.08, N = 3SE +/- 3.71, N = 3SE +/- 11.59, N = 3SE +/- 4.06, N = 3SE +/- 4.48, N = 3SE +/- 9.25, N = 3SE +/- 21.66, N = 3SE +/- 29.76, N = 3SE +/- 18.17, N = 3SE +/- 4.22, N = 3SE +/- 21.71, N = 3SE +/- 7.88, N = 3SE +/- 17.74, N = 3SE +/- 14.94, N = 3SE +/- 24.20, N = 3SE +/- 30.58, N = 3SE +/- 28.74, N = 3SE +/- 31.53, N = 3SE +/- 29.21, N = 3SE +/- 4.10, N = 33868.53865.13861.63855.93855.83855.43848.03839.43837.33836.53836.43836.23835.53834.93831.93829.83824.53821.53812.53812.33805.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096GVNO3PessimisticNewGVNO3GVNO2PessimisticO3NewGVNO333PessimisticNewGVNO2OptPREO2PessimisticO2NewGVNO2OptRedO2OptRedO3NewGVNO2-debugOptPREO3OptSimplO3OptNoSimplO3debugNoGVNO3OptSimplO2OptNoSimplO22 x Intel Xeon E5-2620 v2NoGVNO28001600240032004000SE +/- 8.49, N = 3SE +/- 8.18, N = 3SE +/- 9.96, N = 3SE +/- 5.80, N = 3SE +/- 0.15, N = 3SE +/- 16.37, N = 3SE +/- 11.64, N = 3SE +/- 10.45, N = 3SE +/- 4.58, N = 3SE +/- 4.75, N = 3SE +/- 5.34, N = 3SE +/- 9.04, N = 3SE +/- 4.73, N = 3SE +/- 15.54, N = 3SE +/- 13.95, N = 3SE +/- 17.36, N = 3SE +/- 13.90, N = 3SE +/- 7.30, N = 3SE +/- 1.10, N = 3SE +/- 8.30, N = 3SE +/- 8.63, N = 33950.03949.13936.03934.63934.43932.33931.33927.13924.23923.93921.73920.93920.73919.33917.43914.83912.13911.53909.53907.03895.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64PessimisticO3OptSimplO2NoGVNO2debugOptSimplO3OptPREO2OptNoSimplO3PessimisticNewGVNO2GVNO32 x Intel Xeon E5-2620 v2OptRedO3NoGVNO3NewGVNO2-debugNewGVNO2OptPREO3PessimisticO2OptRedO2NewGVNO333GVNO2PessimisticNewGVNO3OptNoSimplO211002200330044005500SE +/- 4.79, N = 3SE +/- 6.01, N = 3SE +/- 3.97, N = 3SE +/- 2.78, N = 3SE +/- 3.52, N = 3SE +/- 11.20, N = 3SE +/- 4.42, N = 3SE +/- 15.45, N = 3SE +/- 6.59, N = 3SE +/- 7.41, N = 3SE +/- 8.65, N = 3SE +/- 5.21, N = 3SE +/- 1.76, N = 3SE +/- 2.54, N = 3SE +/- 10.53, N = 3SE +/- 7.46, N = 3SE +/- 7.40, N = 3SE +/- 0.13, N = 3SE +/- 14.95, N = 3SE +/- 25.89, N = 3SE +/- 61.09, N = 34914.14911.44907.74906.84905.74902.94899.94899.64899.64899.54899.14897.94897.44894.24893.84893.44892.04891.04885.04884.44846.9

OpenBenchmarking.orgBytes, Fewer Is BetterFFTW 3.3.6Test Install SizeNewGVNO2-debug20K40K60K80K100K116112