fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2409068-NE-FFTW120RU29&sor&grs.

fftw-1.2.0runProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutiondebugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO22 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x7682000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850ASUS VW1905.14.0-474.el9.x86_64 (x86_64)5.14.0-480.el9.x86_64 (x86_64)256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P5.14.0-496.el9.x86_64 (x86_64)GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.25.14.0-503.el9.x86_64 (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptNoSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptRedO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptRedO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptPREO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- GVNO2: CXXFLAGS=-O2 CFLAGS=-O2- NewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- OptPREO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO333: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- GVNO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO2: CXXFLAGS=-O2 CFLAGS=-O2- PessimisticNewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"Compiler Details- Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686Processor Details- Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42eSecurity Details- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

fftw-1.2.0runfftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 1024fftw: Stock - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 4096fftw: Stock - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 512fftw: Stock - 2D FFT Size 512fftw: Stock - 2D FFT Size 256fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Stock - 2D FFT Size 32fftw: Stock - 1D FFT Size 512fftw: Float + SSE - 1D FFT Size 2048fftw: Stock - 1D FFT Size 2048fftw: Stock - 2D FFT Size 128fftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO22751.4164406092.71498210890.76459.17287.0117299870.613679162663123.112702114142706.64309.2140914420.6155523780.73660.64285.64998.54296.2118705218.74320.4155134055.63855.83914.84906.82756.0165486717.915878123167027.07311.4117431035013989161413091.613026113992723.74336.7141864414.6155603773.33686.74282.94967.14308.3118035211.54344.4154614051.33835.53895.44907.72738.9165266717.115952122217077.77347.4119521047014032160593107.213206115112736.04289.8143534379.5154713797.53693.34270.05020.84290.5117505219.04345.7154574055.03861.63912.14897.92231.1134715603.713502119036728.26805.0121881013113651156353053.112789110942662.04337.1142434358.8152893679.23671.34233.75008.94309.2117145127.94340.1155434046.93812.53909.54846.92239.3133945618.513636119066665.96925.7122951022913831155173048.512740110872642.14336.6142594354.3152703736.53634.94178.45024.64316.1117155115.04285.8155394046.43848.03907.04899.52200.0134145634.213740119496731.16912.7125919964.413795156303018.712850111372664.34354.7143174364.2154003751.13616.94198.14898.04312.8117785144.64314.4155514031.33839.43917.44899.92757.9164446694.615782123377024.07116.0115781025614455159623110.313245114402726.64318.6143374408.4154143748.43642.84276.04962.74275.6117805217.94334.2155264030.63836.53911.54911.42758.9163736707.915981121827072.17041.3119341032113983160373083.813058113782725.04326.3141854417.1155743759.43673.04286.34963.84247.7118585213.44346.8156314039.53836.43919.34905.72773.9166686697.315989122027021.67252.4116091035414125160993123.413072113562728.14320.8142344415.5152943780.63622.24280.94957.94288.1118365189.34340.0155684048.33831.93923.94892.02767.1165196718.815845122987059.07194.9119431039713997161693149.113000113372707.24337.3144304415.3154043784.43655.84252.94990.14329.5117095220.54367.8155384037.23821.53921.74899.12762.4163446688.415388121257005.57355.91178810357.814247160903126.013167114652710.84286.4142794402.8155383775.93641.64267.95000.04314.3118215144.84366.8154074045.53855.93931.34902.92755.0166406703.216035122186983.67205.1117399996.314044161673142.012831114112716.24206.7143634433.2157043767.93589.54232.84997.44300.9118415179.14298.4154283989.43836.23936.04885.02752.9163446697.215906122136988.07397.6117961041814150162343138.112908114082727.84316.4144384405.0156123769.93648.04263.25005.74308.3116525213.24334.2155644051.53834.93924.24894.22755.4163486707.016141121326992.77126.7117811031514266161433085.713143114822700.44321.4139794427.2152103785.13583.54288.45008.84241.6118935209.04338.5153894034.43837.33920.74893.82745.8165546732.216110119907080.87316.41199110279.514009159393114.713219115482718.04265.1144344294.0155453776.93610.34286.75021.34317.0118035180.44332.9155714053.43824.53934.44891.02779.8163116724.515842120647049.27243.1117951037214124161193120.213146114672705.04347.4142894327.3155763761.93654.94290.65019.84313.4118185176.44352.1154264042.03805.33950.04899.62774.8164876716.115611121317085.37166.1118201041013794162233133.013019114802726.84280.9143754434.7155253780.83603.64282.54979.94289.2118215176.14350.4154814063.83812.33934.64914.12766.5164486692.215994121297027.57191.3115771059814365159793097.413095114022721.24291.0143224405.5154033776.03629.74265.55004.94335.8118065201.14354.2153384034.33865.13927.14893.42758.0165486718.415787122427001.67420.8117261052914328161653118.213031113932731.44260.5142954412.3153803768.03663.64271.75027.44301.8116935173.54319.9153674046.43829.83932.34899.6OpenBenchmarking.org

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GVNO3PessimisticO3OptRedO2OptRedO3PessimisticO2OptPREO2OptSimplO3PessimisticNewGVNO2OptSimplO2NoGVNO2OptPREO3GVNO2NewGVNO2debugNewGVNO333NoGVNO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptNoSimplO36001200180024003000SE +/- 19.40, N = 3SE +/- 23.10, N = 3SE +/- 15.05, N = 3SE +/- 13.13, N = 3SE +/- 23.64, N = 3SE +/- 11.48, N = 3SE +/- 5.14, N = 3SE +/- 7.03, N = 3SE +/- 8.89, N = 3SE +/- 14.23, N = 3SE +/- 7.45, N = 3SE +/- 5.66, N = 3SE +/- 8.68, N = 3SE +/- 5.74, N = 3SE +/- 13.80, N = 3SE +/- 16.56, N = 3SE +/- 8.09, N = 3SE +/- 16.21, N = 3SE +/- 21.99, N = 152779.82774.82773.92767.12766.52762.42758.92758.02757.92756.02755.42755.02752.92751.42745.82738.92239.32231.12200.0

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32OptRedO2GVNO2NewGVNO333PessimisticNewGVNO2NoGVNO2NoGVNO3OptRedO3PessimisticO3PessimisticO2OptSimplO2debugOptSimplO3OptPREO3NewGVNO2OptPREO2GVNO3OptNoSimplO2OptNoSimplO32 x Intel Xeon E5-2620 v24K8K12K16K20KSE +/- 42.15, N = 3SE +/- 87.96, N = 3SE +/- 81.03, N = 3SE +/- 123.12, N = 3SE +/- 136.41, N = 3SE +/- 139.56, N = 3SE +/- 23.39, N = 3SE +/- 72.72, N = 3SE +/- 107.68, N = 3SE +/- 64.02, N = 3SE +/- 98.77, N = 3SE +/- 92.64, N = 3SE +/- 33.40, N = 3SE +/- 197.17, N = 3SE +/- 142.90, N = 3SE +/- 194.05, N = 3SE +/- 131.37, N = 3SE +/- 102.53, N = 3SE +/- 107.35, N = 316668166401655416548165481652616519164871644816444164401637316348163441634416311134711341413394

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096NewGVNO333GVNO3OptRedO3PessimisticNewGVNO2NoGVNO2NoGVNO3PessimisticO3OptSimplO3OptPREO3GVNO2OptRedO2NewGVNO2OptSimplO2PessimisticO2OptPREO2debugOptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO214002800420056007000SE +/- 23.42, N = 3SE +/- 23.90, N = 3SE +/- 15.62, N = 3SE +/- 6.00, N = 3SE +/- 13.11, N = 3SE +/- 21.46, N = 3SE +/- 8.99, N = 3SE +/- 11.45, N = 3SE +/- 20.27, N = 3SE +/- 5.39, N = 3SE +/- 4.50, N = 3SE +/- 10.55, N = 3SE +/- 9.08, N = 3SE +/- 22.81, N = 3SE +/- 15.10, N = 3SE +/- 262.37, N = 9SE +/- 7.27, N = 3SE +/- 8.82, N = 3SE +/- 4.28, N = 36732.26724.56718.86718.46717.96717.16716.16707.96707.06703.26697.36697.26694.66692.26688.46092.75634.25618.55603.7

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64OptPREO3NewGVNO333GVNO2PessimisticO2OptRedO2OptSimplO3NoGVNO3NewGVNO2NoGVNO2OptRedO3GVNO3PessimisticNewGVNO2OptSimplO2PessimisticO3OptPREO2debugOptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO23K6K9K12K15KSE +/- 39.26, N = 3SE +/- 39.26, N = 3SE +/- 50.86, N = 3SE +/- 107.04, N = 3SE +/- 23.46, N = 3SE +/- 82.53, N = 3SE +/- 133.36, N = 3SE +/- 181.67, N = 3SE +/- 70.08, N = 3SE +/- 86.00, N = 3SE +/- 125.58, N = 15SE +/- 148.65, N = 7SE +/- 136.22, N = 15SE +/- 137.71, N = 15SE +/- 159.78, N = 15SE +/- 26.91, N = 3SE +/- 55.77, N = 3SE +/- 70.91, N = 3SE +/- 41.97, N = 316141161101603515994159891598115952159061587815845158421578715782156111538814982137401363613502

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024OptSimplO2NoGVNO2OptRedO3PessimisticNewGVNO2NoGVNO3GVNO2NewGVNO2OptRedO2OptSimplO3OptPREO3PessimisticO3PessimisticO2OptPREO2GVNO3NewGVNO333OptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2debug3K6K9K12K15KSE +/- 48.60, N = 3SE +/- 81.04, N = 3SE +/- 15.68, N = 3SE +/- 81.50, N = 3SE +/- 76.28, N = 3SE +/- 69.17, N = 3SE +/- 63.84, N = 3SE +/- 76.29, N = 3SE +/- 70.39, N = 3SE +/- 74.77, N = 3SE +/- 46.46, N = 3SE +/- 44.50, N = 3SE +/- 58.30, N = 3SE +/- 43.51, N = 3SE +/- 72.25, N = 3SE +/- 58.86, N = 3SE +/- 40.01, N = 3SE +/- 124.86, N = 3SE +/- 644.68, N = 1212337.012316.012298.012242.012221.012218.012213.012202.012182.012132.012131.012129.012125.012064.011990.011949.011906.011903.010890.7

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048PessimisticO3NewGVNO333NoGVNO3OptSimplO3OptRedO3GVNO3PessimisticO2NoGVNO2OptSimplO2OptRedO2OptPREO2PessimisticNewGVNO2OptPREO3NewGVNO2GVNO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2debug15003000450060007500SE +/- 10.65, N = 3SE +/- 2.80, N = 3SE +/- 8.17, N = 3SE +/- 14.30, N = 3SE +/- 28.87, N = 3SE +/- 2.26, N = 3SE +/- 24.24, N = 3SE +/- 11.97, N = 3SE +/- 15.79, N = 3SE +/- 19.13, N = 3SE +/- 26.38, N = 3SE +/- 48.51, N = 3SE +/- 43.40, N = 3SE +/- 12.08, N = 3SE +/- 18.93, N = 3SE +/- 25.37, N = 3SE +/- 33.67, N = 3SE +/- 41.03, N = 3SE +/- 44.49, N = 37085.37080.87077.77072.17059.07049.27027.57027.07024.07021.67005.57001.66992.76988.06983.66731.16728.26665.96459.1

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32PessimisticNewGVNO2NewGVNO2OptPREO2NoGVNO3NewGVNO333NoGVNO2debugOptRedO2GVNO3GVNO2OptRedO3PessimisticO2PessimisticO3OptPREO3OptSimplO2OptSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO3OptNoSimplO216003200480064008000SE +/- 14.66, N = 3SE +/- 12.13, N = 3SE +/- 98.33, N = 3SE +/- 35.65, N = 3SE +/- 72.31, N = 3SE +/- 79.17, N = 3SE +/- 5.79, N = 3SE +/- 76.45, N = 5SE +/- 82.98, N = 4SE +/- 65.67, N = 7SE +/- 57.27, N = 15SE +/- 45.97, N = 3SE +/- 56.02, N = 15SE +/- 54.90, N = 15SE +/- 52.52, N = 15SE +/- 90.54, N = 15SE +/- 64.34, N = 15SE +/- 71.46, N = 15SE +/- 24.17, N = 37420.87397.67355.97347.47316.47311.47287.07252.47243.17205.17194.97191.37166.17126.77116.07041.36925.76912.76805.0

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128OptNoSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO2NewGVNO333NoGVNO3OptRedO3OptSimplO3PessimisticO3NewGVNO2GVNO3OptPREO2OptPREO3NoGVNO2GVNO2debugPessimisticNewGVNO2OptRedO2OptSimplO2PessimisticO23K6K9K12K15KSE +/- 50.90, N = 3SE +/- 82.96, N = 15SE +/- 113.85, N = 15SE +/- 86.04, N = 3SE +/- 92.80, N = 3SE +/- 104.94, N = 8SE +/- 47.40, N = 3SE +/- 101.06, N = 3SE +/- 91.43, N = 10SE +/- 105.79, N = 7SE +/- 94.72, N = 15SE +/- 147.41, N = 3SE +/- 95.52, N = 3SE +/- 95.45, N = 3SE +/- 84.15, N = 12SE +/- 62.19, N = 3SE +/- 48.56, N = 3SE +/- 111.70, N = 3SE +/- 136.55, N = 312591122951218811991119521194311934118201179611795117881178111743117391172911726116091157811577

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64PessimisticO2PessimisticNewGVNO2NoGVNO3NewGVNO2PessimisticO3OptRedO3GVNO3OptPREO2OptRedO2NoGVNO2OptSimplO3OptPREO3NewGVNO333OptSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO2GVNO2OptNoSimplO3debug2K4K6K8K10KSE +/- 17.46, N = 3SE +/- 31.09, N = 3SE +/- 114.62, N = 3SE +/- 85.65, N = 3SE +/- 10.59, N = 3SE +/- 18.11, N = 3SE +/- 40.83, N = 3SE +/- 69.65, N = 13SE +/- 18.19, N = 3SE +/- 99.10, N = 3SE +/- 15.38, N = 3SE +/- 7.77, N = 3SE +/- 101.28, N = 5SE +/- 141.95, N = 3SE +/- 54.50, N = 3SE +/- 33.87, N = 3SE +/- 144.31, N = 15SE +/- 98.25, N = 3SE +/- 172.96, N = 1510598.010529.010470.010418.010410.010397.010372.010357.810354.010350.010321.010315.010279.510256.010229.010131.09996.39964.49870.6

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256OptSimplO2PessimisticO2PessimisticNewGVNO2OptPREO3OptPREO2NewGVNO2OptRedO2GVNO3GVNO2NoGVNO3NewGVNO333OptRedO3NoGVNO2OptSimplO32 x Intel Xeon E5-2620 v2OptNoSimplO3PessimisticO3debugOptNoSimplO23K6K9K12K15KSE +/- 115.02, N = 3SE +/- 169.66, N = 3SE +/- 137.11, N = 3SE +/- 127.87, N = 3SE +/- 145.87, N = 3SE +/- 71.88, N = 3SE +/- 145.93, N = 4SE +/- 169.36, N = 3SE +/- 150.68, N = 4SE +/- 28.45, N = 3SE +/- 146.24, N = 4SE +/- 165.24, N = 3SE +/- 30.75, N = 3SE +/- 36.46, N = 3SE +/- 67.57, N = 3SE +/- 58.29, N = 3SE +/- 101.06, N = 3SE +/- 98.34, N = 3SE +/- 68.09, N = 314455143651432814266142471415014125141241404414032140091399713989139831383113795137941367913651

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024debugNewGVNO2PessimisticO3OptRedO3GVNO2PessimisticNewGVNO2OptPREO3NoGVNO2GVNO3OptRedO2OptPREO2NoGVNO3OptSimplO3PessimisticO2OptSimplO2NewGVNO333OptNoSimplO2OptNoSimplO32 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 80.85, N = 3SE +/- 75.24, N = 3SE +/- 78.81, N = 3SE +/- 94.45, N = 3SE +/- 78.73, N = 3SE +/- 64.06, N = 3SE +/- 21.01, N = 3SE +/- 67.72, N = 3SE +/- 15.17, N = 3SE +/- 85.56, N = 3SE +/- 82.71, N = 3SE +/- 80.68, N = 3SE +/- 141.62, N = 3SE +/- 135.76, N = 3SE +/- 73.63, N = 3SE +/- 129.47, N = 3SE +/- 43.66, N = 3SE +/- 32.22, N = 3SE +/- 112.78, N = 316266162341622316169161671616516143161411611916099160901605916037159791596215939156351563015517

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024OptRedO3GVNO2NewGVNO2PessimisticO3OptPREO2OptRedO2debugGVNO3PessimisticNewGVNO2NewGVNO333OptSimplO2NoGVNO3PessimisticO2NoGVNO2OptPREO3OptSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO37001400210028003500SE +/- 9.39, N = 3SE +/- 5.57, N = 3SE +/- 18.27, N = 3SE +/- 1.11, N = 3SE +/- 8.38, N = 3SE +/- 14.89, N = 3SE +/- 15.08, N = 3SE +/- 12.60, N = 3SE +/- 24.52, N = 3SE +/- 24.45, N = 3SE +/- 26.70, N = 3SE +/- 38.36, N = 3SE +/- 31.08, N = 3SE +/- 13.41, N = 3SE +/- 15.98, N = 3SE +/- 10.08, N = 3SE +/- 12.20, N = 3SE +/- 13.57, N = 3SE +/- 8.28, N = 33149.13142.03138.13133.03126.03123.43123.13120.23118.23114.73110.33107.23097.43091.63085.73083.83053.13048.53018.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128OptSimplO2NewGVNO333NoGVNO3OptPREO2GVNO3OptPREO3PessimisticO2OptRedO2OptSimplO3PessimisticNewGVNO2NoGVNO2PessimisticO3OptRedO3NewGVNO2OptNoSimplO3GVNO2OptNoSimplO22 x Intel Xeon E5-2620 v2debug3K6K9K12K15KSE +/- 45.94, N = 3SE +/- 46.51, N = 3SE +/- 33.80, N = 3SE +/- 132.00, N = 5SE +/- 52.35, N = 3SE +/- 120.91, N = 3SE +/- 165.41, N = 3SE +/- 133.00, N = 3SE +/- 130.44, N = 3SE +/- 93.99, N = 3SE +/- 160.95, N = 3SE +/- 90.08, N = 3SE +/- 67.87, N = 3SE +/- 169.95, N = 3SE +/- 80.49, N = 3SE +/- 79.22, N = 3SE +/- 84.00, N = 3SE +/- 136.77, N = 3SE +/- 136.03, N = 1513245132191320613167131461314313095130721305813031130261301913000129081285012831127891274012702

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 256NewGVNO333NoGVNO3OptPREO3PessimisticO3GVNO3OptPREO2OptSimplO2debugGVNO2NewGVNO2PessimisticO2NoGVNO2PessimisticNewGVNO2OptSimplO3OptRedO2OptRedO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v22K4K6K8K10KSE +/- 65.80, N = 3SE +/- 40.95, N = 3SE +/- 60.95, N = 3SE +/- 62.20, N = 3SE +/- 34.04, N = 3SE +/- 55.86, N = 3SE +/- 35.14, N = 3SE +/- 86.71, N = 3SE +/- 40.76, N = 3SE +/- 84.51, N = 3SE +/- 73.06, N = 3SE +/- 85.30, N = 3SE +/- 68.55, N = 3SE +/- 91.42, N = 3SE +/- 108.39, N = 3SE +/- 140.84, N = 3SE +/- 21.39, N = 3SE +/- 115.75, N = 3SE +/- 119.21, N = 311548115111148211480114671146511440114141141111408114021139911393113781135611337111371109411087

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048NoGVNO3PessimisticNewGVNO2OptRedO2NewGVNO2PessimisticO3OptSimplO2OptSimplO3NoGVNO2PessimisticO2NewGVNO333GVNO2OptPREO2OptRedO3debugGVNO3OptPREO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v26001200180024003000SE +/- 12.56, N = 3SE +/- 12.84, N = 3SE +/- 16.99, N = 3SE +/- 19.22, N = 3SE +/- 5.76, N = 3SE +/- 17.93, N = 3SE +/- 16.84, N = 3SE +/- 8.21, N = 3SE +/- 14.64, N = 3SE +/- 6.49, N = 3SE +/- 13.80, N = 3SE +/- 13.94, N = 3SE +/- 12.30, N = 3SE +/- 3.24, N = 3SE +/- 18.48, N = 3SE +/- 11.61, N = 3SE +/- 6.56, N = 3SE +/- 6.15, N = 3SE +/- 5.49, N = 32736.02731.42728.12727.82726.82726.62725.02723.72721.22718.02716.22710.82707.22706.62705.02700.42664.32662.02642.1

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64OptNoSimplO3GVNO3OptRedO3OptNoSimplO2NoGVNO22 x Intel Xeon E5-2620 v2OptSimplO3OptPREO3OptRedO2OptSimplO2NewGVNO2debugPessimisticO2NoGVNO3OptPREO2PessimisticO3NewGVNO333PessimisticNewGVNO2GVNO29001800270036004500SE +/- 7.74, N = 3SE +/- 2.62, N = 3SE +/- 18.00, N = 3SE +/- 4.58, N = 3SE +/- 4.40, N = 3SE +/- 5.04, N = 3SE +/- 21.71, N = 3SE +/- 3.34, N = 3SE +/- 22.79, N = 3SE +/- 4.27, N = 3SE +/- 22.90, N = 3SE +/- 21.54, N = 3SE +/- 52.12, N = 4SE +/- 46.52, N = 4SE +/- 61.56, N = 3SE +/- 60.71, N = 3SE +/- 60.33, N = 3SE +/- 56.44, N = 3SE +/- 51.49, N = 44354.74347.44337.34337.14336.74336.64326.34321.44320.84318.64316.44309.24291.04289.84286.44280.94265.14260.54206.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096NewGVNO2NewGVNO333OptRedO3PessimisticO3GVNO2NoGVNO3OptSimplO2PessimisticO2OptNoSimplO3PessimisticNewGVNO2GVNO3OptPREO22 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO2NoGVNO2OptSimplO3debugOptPREO33K6K9K12K15KSE +/- 56.04, N = 3SE +/- 25.71, N = 3SE +/- 94.63, N = 3SE +/- 33.27, N = 3SE +/- 46.84, N = 3SE +/- 135.86, N = 3SE +/- 123.09, N = 3SE +/- 9.50, N = 3SE +/- 55.79, N = 3SE +/- 111.29, N = 3SE +/- 41.07, N = 3SE +/- 55.00, N = 3SE +/- 82.03, N = 3SE +/- 19.46, N = 3SE +/- 62.13, N = 3SE +/- 80.21, N = 3SE +/- 94.10, N = 3SE +/- 193.73, N = 3SE +/- 89.82, N = 314438144341443014375143631435314337143221431714295142891427914259142431423414186141851409113979

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128PessimisticO3GVNO2OptPREO3debugOptSimplO3OptRedO2OptRedO3NoGVNO2PessimisticNewGVNO2OptSimplO2PessimisticO2NewGVNO2OptPREO2NoGVNO3OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2GVNO3NewGVNO33310002000300040005000SE +/- 1.58, N = 3SE +/- 1.52, N = 3SE +/- 9.28, N = 3SE +/- 6.43, N = 3SE +/- 2.14, N = 3SE +/- 4.33, N = 3SE +/- 2.25, N = 3SE +/- 2.46, N = 3SE +/- 6.19, N = 3SE +/- 2.45, N = 3SE +/- 3.05, N = 3SE +/- 5.65, N = 3SE +/- 4.02, N = 3SE +/- 35.10, N = 9SE +/- 2.94, N = 3SE +/- 1.43, N = 3SE +/- 3.10, N = 3SE +/- 52.07, N = 15SE +/- 51.58, N = 154434.74433.24427.24420.64417.14415.54415.34414.64412.34408.44405.54405.04402.84379.54364.24358.84354.34327.34294.0

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512GVNO2NewGVNO2GVNO3OptSimplO3NoGVNO2debugNewGVNO333OptPREO2PessimisticO3NoGVNO3OptSimplO2OptRedO3PessimisticO2OptNoSimplO3PessimisticNewGVNO2OptRedO2OptNoSimplO22 x Intel Xeon E5-2620 v2OptPREO33K6K9K12K15KSE +/- 89.29, N = 3SE +/- 71.36, N = 3SE +/- 109.69, N = 3SE +/- 63.39, N = 3SE +/- 81.91, N = 3SE +/- 123.58, N = 3SE +/- 163.42, N = 3SE +/- 127.81, N = 3SE +/- 168.49, N = 3SE +/- 120.34, N = 3SE +/- 60.73, N = 3SE +/- 104.45, N = 3SE +/- 89.79, N = 3SE +/- 66.70, N = 3SE +/- 109.70, N = 3SE +/- 195.78, N = 3SE +/- 157.18, N = 3SE +/- 127.67, N = 3SE +/- 45.83, N = 315704156121557615574155601555215545155381552515471154141540415403154001538015294152891527015210

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512NoGVNO3OptPREO3OptRedO3PessimisticO3debugOptRedO2NewGVNO333PessimisticO2OptPREO2NoGVNO2NewGVNO2PessimisticNewGVNO2GVNO2GVNO3OptSimplO3OptNoSimplO3OptSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO28001600240032004000SE +/- 10.93, N = 3SE +/- 3.13, N = 3SE +/- 7.10, N = 3SE +/- 21.49, N = 3SE +/- 10.97, N = 3SE +/- 6.77, N = 3SE +/- 14.05, N = 3SE +/- 11.78, N = 3SE +/- 16.55, N = 3SE +/- 13.13, N = 3SE +/- 20.08, N = 3SE +/- 14.01, N = 3SE +/- 5.58, N = 3SE +/- 6.57, N = 3SE +/- 9.69, N = 3SE +/- 14.36, N = 3SE +/- 34.42, N = 12SE +/- 41.40, N = 5SE +/- 38.03, N = 53797.53785.13784.43780.83780.73780.63776.93776.03775.93773.33769.93768.03767.93761.93759.43751.13748.43736.53679.2

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256NoGVNO3NoGVNO2OptSimplO3OptNoSimplO2PessimisticNewGVNO2debugOptRedO3GVNO3NewGVNO2OptSimplO2OptPREO22 x Intel Xeon E5-2620 v2PessimisticO2OptRedO2OptNoSimplO3NewGVNO333PessimisticO3GVNO2OptPREO38001600240032004000SE +/- 7.11, N = 3SE +/- 13.35, N = 3SE +/- 34.82, N = 3SE +/- 17.09, N = 3SE +/- 4.70, N = 3SE +/- 23.34, N = 3SE +/- 33.79, N = 3SE +/- 12.33, N = 3SE +/- 33.28, N = 3SE +/- 34.93, N = 3SE +/- 12.43, N = 3SE +/- 23.89, N = 3SE +/- 20.14, N = 3SE +/- 16.18, N = 3SE +/- 37.88, N = 3SE +/- 23.72, N = 3SE +/- 50.06, N = 3SE +/- 47.63, N = 3SE +/- 35.49, N = 33693.33686.73673.03671.33663.63660.63655.83654.93648.03642.83641.63634.93629.73622.23616.93610.33603.63589.53583.5

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GVNO3OptPREO3NewGVNO333OptSimplO3debugNoGVNO2PessimisticO3OptRedO2OptSimplO2PessimisticNewGVNO2NoGVNO3OptPREO2PessimisticO2NewGVNO2OptRedO3OptNoSimplO2GVNO2OptNoSimplO32 x Intel Xeon E5-2620 v29001800270036004500SE +/- 8.12, N = 3SE +/- 4.55, N = 3SE +/- 7.01, N = 3SE +/- 4.50, N = 3SE +/- 5.54, N = 3SE +/- 2.87, N = 3SE +/- 10.52, N = 3SE +/- 4.77, N = 3SE +/- 11.99, N = 3SE +/- 6.86, N = 3SE +/- 19.30, N = 3SE +/- 14.59, N = 3SE +/- 26.82, N = 3SE +/- 9.92, N = 3SE +/- 32.58, N = 3SE +/- 8.01, N = 3SE +/- 48.65, N = 4SE +/- 25.83, N = 3SE +/- 59.35, N = 34290.64288.44286.74286.34285.64282.94282.54280.94276.04271.74270.04267.94265.54263.24252.94233.74232.84198.14178.4

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32PessimisticNewGVNO22 x Intel Xeon E5-2620 v2NewGVNO333NoGVNO3GVNO3OptNoSimplO2OptPREO3NewGVNO2PessimisticO2OptPREO2debugGVNO2OptRedO3PessimisticO3NoGVNO2OptSimplO3OptSimplO2OptRedO2OptNoSimplO311002200330044005500SE +/- 1.59, N = 3SE +/- 4.74, N = 3SE +/- 2.77, N = 3SE +/- 23.10, N = 3SE +/- 26.35, N = 3SE +/- 17.04, N = 3SE +/- 18.68, N = 3SE +/- 31.02, N = 3SE +/- 24.70, N = 3SE +/- 15.07, N = 3SE +/- 29.89, N = 3SE +/- 13.08, N = 3SE +/- 13.97, N = 3SE +/- 15.16, N = 3SE +/- 8.35, N = 3SE +/- 39.97, N = 3SE +/- 25.51, N = 3SE +/- 31.02, N = 3SE +/- 95.20, N = 135027.45024.65021.35020.85019.85008.95008.85005.75004.95000.04998.54997.44990.14979.94967.14963.84962.74957.94898.0

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256PessimisticO2OptRedO3NewGVNO3332 x Intel Xeon E5-2620 v2OptPREO2GVNO3OptNoSimplO3OptNoSimplO2NewGVNO2NoGVNO2PessimisticNewGVNO2GVNO2debugNoGVNO3PessimisticO3OptRedO2OptSimplO2OptSimplO3OptPREO39001800270036004500SE +/- 12.05, N = 3SE +/- 17.30, N = 3SE +/- 17.44, N = 3SE +/- 27.37, N = 3SE +/- 3.18, N = 3SE +/- 19.87, N = 3SE +/- 24.49, N = 3SE +/- 1.60, N = 3SE +/- 5.17, N = 3SE +/- 1.57, N = 3SE +/- 2.74, N = 3SE +/- 24.27, N = 3SE +/- 1.72, N = 3SE +/- 15.53, N = 3SE +/- 10.82, N = 3SE +/- 22.07, N = 3SE +/- 8.48, N = 3SE +/- 53.04, N = 3SE +/- 43.31, N = 34335.84329.54317.04316.14314.34313.44312.84309.24308.34308.34301.84300.94296.24290.54289.24288.14275.64247.74241.6

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512OptPREO3debugOptSimplO3GVNO2OptRedO2PessimisticO3OptPREO2GVNO3PessimisticO2NewGVNO333NoGVNO2OptSimplO2OptNoSimplO3NoGVNO32 x Intel Xeon E5-2620 v2OptNoSimplO2OptRedO3PessimisticNewGVNO2NewGVNO23K6K9K12K15KSE +/- 57.89, N = 3SE +/- 97.11, N = 3SE +/- 36.67, N = 3SE +/- 40.29, N = 3SE +/- 93.72, N = 3SE +/- 60.34, N = 3SE +/- 83.95, N = 3SE +/- 52.21, N = 3SE +/- 74.64, N = 3SE +/- 41.82, N = 3SE +/- 84.01, N = 3SE +/- 128.25, N = 3SE +/- 61.50, N = 3SE +/- 19.97, N = 3SE +/- 37.37, N = 3SE +/- 64.54, N = 3SE +/- 39.68, N = 3SE +/- 106.71, N = 3SE +/- 64.51, N = 311893118701185811841118361182111821118181180611803118031178011778117501171511714117091169311652

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32OptRedO3NoGVNO3debugOptSimplO2OptSimplO3NewGVNO2NoGVNO2OptPREO3PessimisticO2OptRedO2NewGVNO333GVNO2GVNO3PessimisticO3PessimisticNewGVNO2OptPREO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v211002200330044005500SE +/- 11.16, N = 3SE +/- 6.35, N = 3SE +/- 7.51, N = 3SE +/- 6.97, N = 3SE +/- 12.54, N = 3SE +/- 1.29, N = 3SE +/- 12.52, N = 3SE +/- 5.69, N = 3SE +/- 11.17, N = 3SE +/- 32.60, N = 3SE +/- 11.03, N = 3SE +/- 33.03, N = 3SE +/- 39.92, N = 3SE +/- 36.57, N = 3SE +/- 33.93, N = 3SE +/- 55.21, N = 3SE +/- 41.81, N = 3SE +/- 43.81, N = 3SE +/- 6.03, N = 35220.55219.05218.75217.95213.45213.25211.55209.05201.15189.35180.45179.15176.45176.15173.55144.85144.65127.95115.0

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 512OptRedO3OptPREO2PessimisticO2GVNO3PessimisticO3OptSimplO3NoGVNO3NoGVNO2OptNoSimplO2OptRedO2OptPREO3NewGVNO2OptSimplO2NewGVNO333debugPessimisticNewGVNO2OptNoSimplO3GVNO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 10.85, N = 3SE +/- 8.76, N = 3SE +/- 18.08, N = 3SE +/- 15.02, N = 3SE +/- 16.20, N = 3SE +/- 4.56, N = 3SE +/- 15.04, N = 3SE +/- 12.50, N = 3SE +/- 2.84, N = 3SE +/- 3.08, N = 3SE +/- 13.39, N = 3SE +/- 7.37, N = 3SE +/- 27.65, N = 3SE +/- 10.16, N = 3SE +/- 16.10, N = 3SE +/- 32.02, N = 3SE +/- 11.47, N = 3SE +/- 23.58, N = 3SE +/- 22.16, N = 34367.84366.84354.24352.14350.44346.84345.74344.44340.14340.04338.54334.24334.24332.94320.44319.94314.44298.44285.8

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048OptSimplO3NewGVNO333OptRedO2NewGVNO2OptNoSimplO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptRedO3OptSimplO2debugPessimisticO3NoGVNO2NoGVNO3GVNO2GVNO3OptPREO2OptPREO3PessimisticNewGVNO2PessimisticO23K6K9K12K15KSE +/- 27.28, N = 3SE +/- 20.01, N = 3SE +/- 58.95, N = 3SE +/- 42.13, N = 3SE +/- 15.63, N = 3SE +/- 24.95, N = 3SE +/- 107.49, N = 3SE +/- 33.86, N = 3SE +/- 37.49, N = 3SE +/- 41.31, N = 3SE +/- 97.36, N = 3SE +/- 75.96, N = 3SE +/- 78.24, N = 3SE +/- 20.00, N = 3SE +/- 76.47, N = 3SE +/- 53.18, N = 3SE +/- 16.00, N = 3SE +/- 143.99, N = 3SE +/- 130.85, N = 315631155711556815564155511554315539155381552615513154811546115457154281542615407153891536715338

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048PessimisticO3debugNoGVNO3NewGVNO333NewGVNO2NoGVNO2OptRedO2OptNoSimplO2PessimisticNewGVNO22 x Intel Xeon E5-2620 v2OptPREO2GVNO3OptSimplO3OptRedO3OptPREO3PessimisticO2OptNoSimplO3OptSimplO2GVNO29001800270036004500SE +/- 9.60, N = 3SE +/- 6.96, N = 3SE +/- 8.12, N = 3SE +/- 8.15, N = 3SE +/- 12.48, N = 3SE +/- 6.84, N = 3SE +/- 14.17, N = 3SE +/- 18.65, N = 3SE +/- 4.42, N = 3SE +/- 11.79, N = 3SE +/- 4.44, N = 3SE +/- 14.25, N = 3SE +/- 7.07, N = 3SE +/- 22.16, N = 3SE +/- 16.83, N = 3SE +/- 6.35, N = 3SE +/- 6.58, N = 3SE +/- 14.04, N = 3SE +/- 47.88, N = 44063.84055.64055.04053.44051.54051.34048.34046.94046.44046.44045.54042.04039.54037.24034.44034.34031.34030.63989.4

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128PessimisticO2NoGVNO3OptPREO2debug2 x Intel Xeon E5-2620 v2OptNoSimplO3OptPREO3OptSimplO2OptSimplO3GVNO2NoGVNO2NewGVNO2OptRedO2PessimisticNewGVNO2NewGVNO333OptRedO3OptNoSimplO2PessimisticO3GVNO38001600240032004000SE +/- 2.08, N = 3SE +/- 3.71, N = 3SE +/- 11.59, N = 3SE +/- 4.06, N = 3SE +/- 9.25, N = 3SE +/- 21.66, N = 3SE +/- 29.76, N = 3SE +/- 18.17, N = 3SE +/- 4.22, N = 3SE +/- 21.71, N = 3SE +/- 7.88, N = 3SE +/- 17.74, N = 3SE +/- 14.94, N = 3SE +/- 24.20, N = 3SE +/- 30.58, N = 3SE +/- 28.74, N = 3SE +/- 31.53, N = 3SE +/- 29.21, N = 3SE +/- 4.10, N = 33865.13861.63855.93855.83848.03839.43837.33836.53836.43836.23835.53834.93831.93829.83824.53821.53812.53812.33805.3

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096GVNO3GVNO2PessimisticO3NewGVNO333PessimisticNewGVNO2OptPREO2PessimisticO2NewGVNO2OptRedO2OptRedO3OptPREO3OptSimplO3OptNoSimplO3debugNoGVNO3OptSimplO2OptNoSimplO22 x Intel Xeon E5-2620 v2NoGVNO28001600240032004000SE +/- 8.49, N = 3SE +/- 9.96, N = 3SE +/- 5.80, N = 3SE +/- 0.15, N = 3SE +/- 16.37, N = 3SE +/- 11.64, N = 3SE +/- 10.45, N = 3SE +/- 4.58, N = 3SE +/- 4.75, N = 3SE +/- 5.34, N = 3SE +/- 4.73, N = 3SE +/- 15.54, N = 3SE +/- 13.95, N = 3SE +/- 17.36, N = 3SE +/- 13.90, N = 3SE +/- 7.30, N = 3SE +/- 1.10, N = 3SE +/- 8.30, N = 3SE +/- 8.63, N = 33950.03936.03934.63934.43932.33931.33927.13924.23923.93921.73920.73919.33917.43914.83912.13911.53909.53907.03895.4

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64PessimisticO3OptSimplO2NoGVNO2debugOptSimplO3OptPREO2OptNoSimplO3PessimisticNewGVNO2GVNO32 x Intel Xeon E5-2620 v2OptRedO3NoGVNO3NewGVNO2OptPREO3PessimisticO2OptRedO2NewGVNO333GVNO2OptNoSimplO211002200330044005500SE +/- 4.79, N = 3SE +/- 6.01, N = 3SE +/- 3.97, N = 3SE +/- 2.78, N = 3SE +/- 3.52, N = 3SE +/- 11.20, N = 3SE +/- 4.42, N = 3SE +/- 15.45, N = 3SE +/- 6.59, N = 3SE +/- 7.41, N = 3SE +/- 8.65, N = 3SE +/- 5.21, N = 3SE +/- 2.54, N = 3SE +/- 10.53, N = 3SE +/- 7.46, N = 3SE +/- 7.40, N = 3SE +/- 0.13, N = 3SE +/- 14.95, N = 3SE +/- 61.09, N = 34914.14911.44907.74906.84905.74902.94899.94899.64899.64899.54899.14897.94894.24893.84893.44892.04891.04885.04846.9


Phoronix Test Suite v10.8.5