fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2409068-NE-FFTW120RU29&grr&sro.

fftw-1.2.0runProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutiondebugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO22 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x7682000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850ASUS VW1905.14.0-474.el9.x86_64 (x86_64)5.14.0-480.el9.x86_64 (x86_64)256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P5.14.0-496.el9.x86_64 (x86_64)GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.25.14.0-503.el9.x86_64 (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptNoSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptRedO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptRedO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptPREO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- GVNO2: CXXFLAGS=-O2 CFLAGS=-O2- NewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- OptPREO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO333: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- GVNO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO2: CXXFLAGS=-O2 CFLAGS=-O2- PessimisticNewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"Compiler Details- Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686Processor Details- Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42eSecurity Details- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

fftw-1.2.0runfftw: Float + SSE - 2D FFT Size 4096fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 2048fftw: Stock - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 1024fftw: Stock - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 2D FFT Size 128fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 2048fftw: Stock - 1D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Stock - 1D FFT Size 2048fftw: Stock - 2D FFT Size 128fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 64fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 1024fftw: Stock - 2D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO26092.72751.46459.12706.610890.73123.17287.0140911498211870117293914.83780.7155134420.6114144055.63855.8162669870.64296.24285.63660.6155524309.24998.512702136794320.4164404906.85218.76717.92756.07027.02723.7123163091.67311.4141861587811803117433895.43773.3154614414.6113994051.33835.516141103504308.34282.93686.7155604336.74967.113026139894344.4165484907.75211.56717.12738.97077.72736.0122213107.27347.4143531595211750119523912.13797.5154574379.5115114055.03861.616059104704290.54270.03693.3154714289.85020.813206140324345.7165264897.95219.05603.72231.16728.22662.0119033053.16805.0142431350211714121883909.53679.2155434358.8110944046.93812.515635101314309.24233.73671.3152894337.15008.912789136514340.1134714846.95127.95618.52239.36665.92642.1119063048.56925.7142591363611715122953907.03736.5155394354.3110874046.43848.015517102294316.14178.43634.9152704336.65024.612740138314285.8133944899.55115.05634.22200.06731.12664.3119493018.76912.7143171374011778125913917.43751.1155514364.2111374031.33839.4156309964.44312.84198.13616.9154004354.74898.012850137954314.4134144899.95144.66694.62757.97024.02726.6123373110.37116.0143371578211780115783911.53748.4155264408.4114404030.63836.515962102564275.64276.03642.8154144318.64962.713245144554334.2164444911.45217.96707.92758.97072.12725.0121823083.87041.3141851598111858119343919.33759.4156314417.1113784039.53836.416037103214247.74286.33673.0155744326.34963.813058139834346.8163734905.75213.46697.32773.97021.62728.1122023123.47252.4142341598911836116093923.93780.6155684415.5113564048.33831.916099103544288.14280.93622.2152944320.84957.913072141254340.0166684892.05189.36718.82767.17059.02707.2122983149.17194.9144301584511709119433921.73784.4155384415.3113374037.23821.516169103974329.54252.93655.8154044337.34990.113000139974367.8165194899.15220.56688.42762.47005.52710.8121253126.07355.9142791538811821117883931.33775.9154074402.8114654045.53855.91609010357.84314.34267.93641.6155384286.45000.013167142474366.8163444902.95144.86703.22755.06983.62716.2122183142.07205.1143631603511841117393936.03767.9154284433.2114113989.43836.2161679996.34300.94232.83589.5157044206.74997.412831140444298.4166404885.05179.16697.22752.96988.02727.8122133138.17397.6144381590611652117963924.23769.9155644405.0114084051.53834.916234104184308.34263.23648.0156124316.45005.712908141504334.2163444894.25213.26707.02755.46992.72700.4121323085.77126.7139791614111893117813920.73785.1153894427.2114824034.43837.316143103154241.64288.43583.5152104321.45008.813143142664338.5163484893.85209.06732.22745.87080.82718.0119903114.77316.4144341611011803119913934.43776.9155714294.0115484053.43824.51593910279.54317.04286.73610.3155454265.15021.313219140094332.9165544891.05180.46724.52779.87049.22705.0120643120.27243.1142891584211818117953950.03761.9154264327.3114674042.03805.316119103724313.44290.63654.9155764347.45019.813146141244352.1163114899.65176.46716.12774.87085.32726.8121313133.07166.1143751561111821118203934.63780.8154814434.7114804063.83812.316223104104289.24282.53603.6155254280.94979.913019137944350.4164874914.15176.16692.22766.57027.52721.2121293097.47191.3143221599411806115773927.13776.0153384405.5114024034.33865.115979105984335.84265.53629.7154034291.05004.913095143654354.2164484893.45201.16718.42758.07001.62731.4122423118.27420.8142951578711693117263932.33768.0153674412.3113934046.43829.816165105294301.84271.73663.6153804260.55027.413031143284319.9165484899.65173.5OpenBenchmarking.org

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 40962 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug14002800420056007000SE +/- 8.82, N = 3SE +/- 5.39, N = 3SE +/- 23.90, N = 3SE +/- 10.55, N = 3SE +/- 23.42, N = 3SE +/- 13.11, N = 3SE +/- 21.46, N = 3SE +/- 4.28, N = 3SE +/- 7.27, N = 3SE +/- 15.10, N = 3SE +/- 20.27, N = 3SE +/- 4.50, N = 3SE +/- 15.62, N = 3SE +/- 9.08, N = 3SE +/- 11.45, N = 3SE +/- 6.00, N = 3SE +/- 22.81, N = 3SE +/- 8.99, N = 3SE +/- 262.37, N = 95618.56703.26724.56697.26732.26717.96717.15603.75634.26688.46707.06697.36718.86694.66707.96718.46692.26716.16092.7

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 40962 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug6001200180024003000SE +/- 8.09, N = 3SE +/- 5.66, N = 3SE +/- 19.40, N = 3SE +/- 8.68, N = 3SE +/- 13.80, N = 3SE +/- 14.23, N = 3SE +/- 16.56, N = 3SE +/- 16.21, N = 3SE +/- 21.99, N = 15SE +/- 11.48, N = 3SE +/- 7.45, N = 3SE +/- 15.05, N = 3SE +/- 13.13, N = 3SE +/- 8.89, N = 3SE +/- 5.14, N = 3SE +/- 7.03, N = 3SE +/- 23.64, N = 3SE +/- 23.10, N = 3SE +/- 5.74, N = 32239.32755.02779.82752.92745.82756.02738.92231.12200.02762.42755.42773.92767.12757.92758.92758.02766.52774.82751.4

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 20482 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug15003000450060007500SE +/- 41.03, N = 3SE +/- 18.93, N = 3SE +/- 2.26, N = 3SE +/- 12.08, N = 3SE +/- 2.80, N = 3SE +/- 11.97, N = 3SE +/- 8.17, N = 3SE +/- 33.67, N = 3SE +/- 25.37, N = 3SE +/- 26.38, N = 3SE +/- 43.40, N = 3SE +/- 19.13, N = 3SE +/- 28.87, N = 3SE +/- 15.79, N = 3SE +/- 14.30, N = 3SE +/- 48.51, N = 3SE +/- 24.24, N = 3SE +/- 10.65, N = 3SE +/- 44.49, N = 36665.96983.67049.26988.07080.87027.07077.76728.26731.17005.56992.77021.67059.07024.07072.17001.67027.57085.36459.1

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug6001200180024003000SE +/- 5.49, N = 3SE +/- 13.80, N = 3SE +/- 18.48, N = 3SE +/- 19.22, N = 3SE +/- 6.49, N = 3SE +/- 8.21, N = 3SE +/- 12.56, N = 3SE +/- 6.15, N = 3SE +/- 6.56, N = 3SE +/- 13.94, N = 3SE +/- 11.61, N = 3SE +/- 16.99, N = 3SE +/- 12.30, N = 3SE +/- 17.93, N = 3SE +/- 16.84, N = 3SE +/- 12.84, N = 3SE +/- 14.64, N = 3SE +/- 5.76, N = 3SE +/- 3.24, N = 32642.12716.22705.02727.82718.02723.72736.02662.02664.32710.82700.42728.12707.22726.62725.02731.42721.22726.82706.6

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 10242 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 40.01, N = 3SE +/- 69.17, N = 3SE +/- 43.51, N = 3SE +/- 63.84, N = 3SE +/- 72.25, N = 3SE +/- 81.04, N = 3SE +/- 76.28, N = 3SE +/- 124.86, N = 3SE +/- 58.86, N = 3SE +/- 58.30, N = 3SE +/- 74.77, N = 3SE +/- 76.29, N = 3SE +/- 15.68, N = 3SE +/- 48.60, N = 3SE +/- 70.39, N = 3SE +/- 81.50, N = 3SE +/- 44.50, N = 3SE +/- 46.46, N = 3SE +/- 644.68, N = 1211906.012218.012064.012213.011990.012316.012221.011903.011949.012125.012132.012202.012298.012337.012182.012242.012129.012131.010890.7

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 10242 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug7001400210028003500SE +/- 13.57, N = 3SE +/- 5.57, N = 3SE +/- 12.60, N = 3SE +/- 18.27, N = 3SE +/- 24.45, N = 3SE +/- 13.41, N = 3SE +/- 38.36, N = 3SE +/- 12.20, N = 3SE +/- 8.28, N = 3SE +/- 8.38, N = 3SE +/- 15.98, N = 3SE +/- 14.89, N = 3SE +/- 9.39, N = 3SE +/- 26.70, N = 3SE +/- 10.08, N = 3SE +/- 24.52, N = 3SE +/- 31.08, N = 3SE +/- 1.11, N = 3SE +/- 15.08, N = 33048.53142.03120.23138.13114.73091.63107.23053.13018.73126.03085.73123.43149.13110.33083.83118.23097.43133.03123.1

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 322 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug16003200480064008000SE +/- 64.34, N = 15SE +/- 65.67, N = 7SE +/- 82.98, N = 4SE +/- 12.13, N = 3SE +/- 72.31, N = 3SE +/- 79.17, N = 3SE +/- 35.65, N = 3SE +/- 24.17, N = 3SE +/- 71.46, N = 15SE +/- 98.33, N = 3SE +/- 54.90, N = 15SE +/- 76.45, N = 5SE +/- 57.27, N = 15SE +/- 52.52, N = 15SE +/- 90.54, N = 15SE +/- 14.66, N = 3SE +/- 45.97, N = 3SE +/- 56.02, N = 15SE +/- 5.79, N = 36925.77205.17243.17397.67316.47311.47347.46805.06912.77355.97126.77252.47194.97116.07041.37420.87191.37166.17287.0

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 40962 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 82.03, N = 3SE +/- 46.84, N = 3SE +/- 41.07, N = 3SE +/- 56.04, N = 3SE +/- 25.71, N = 3SE +/- 80.21, N = 3SE +/- 135.86, N = 3SE +/- 19.46, N = 3SE +/- 55.79, N = 3SE +/- 55.00, N = 3SE +/- 89.82, N = 3SE +/- 62.13, N = 3SE +/- 94.63, N = 3SE +/- 123.09, N = 3SE +/- 94.10, N = 3SE +/- 111.29, N = 3SE +/- 9.50, N = 3SE +/- 33.27, N = 3SE +/- 193.73, N = 314259143631428914438144341418614353142431431714279139791423414430143371418514295143221437514091

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 642 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 70.91, N = 3SE +/- 50.86, N = 3SE +/- 125.58, N = 15SE +/- 181.67, N = 3SE +/- 39.26, N = 3SE +/- 70.08, N = 3SE +/- 133.36, N = 3SE +/- 41.97, N = 3SE +/- 55.77, N = 3SE +/- 159.78, N = 15SE +/- 39.26, N = 3SE +/- 23.46, N = 3SE +/- 86.00, N = 3SE +/- 136.22, N = 15SE +/- 82.53, N = 3SE +/- 148.65, N = 7SE +/- 107.04, N = 3SE +/- 137.71, N = 15SE +/- 26.91, N = 313636160351584215906161101587815952135021374015388161411598915845157821598115787159941561114982

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 5122 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 37.37, N = 3SE +/- 40.29, N = 3SE +/- 52.21, N = 3SE +/- 64.51, N = 3SE +/- 41.82, N = 3SE +/- 84.01, N = 3SE +/- 19.97, N = 3SE +/- 64.54, N = 3SE +/- 61.50, N = 3SE +/- 83.95, N = 3SE +/- 57.89, N = 3SE +/- 93.72, N = 3SE +/- 39.68, N = 3SE +/- 128.25, N = 3SE +/- 36.67, N = 3SE +/- 106.71, N = 3SE +/- 74.64, N = 3SE +/- 60.34, N = 3SE +/- 97.11, N = 311715118411181811652118031180311750117141177811821118931183611709117801185811693118061182111870

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1282 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 82.96, N = 15SE +/- 95.45, N = 3SE +/- 105.79, N = 7SE +/- 91.43, N = 10SE +/- 86.04, N = 3SE +/- 95.52, N = 3SE +/- 92.80, N = 3SE +/- 113.85, N = 15SE +/- 50.90, N = 3SE +/- 94.72, N = 15SE +/- 147.41, N = 3SE +/- 48.56, N = 3SE +/- 104.94, N = 8SE +/- 111.70, N = 3SE +/- 47.40, N = 3SE +/- 62.19, N = 3SE +/- 136.55, N = 3SE +/- 101.06, N = 3SE +/- 84.15, N = 1212295117391179511796119911174311952121881259111788117811160911943115781193411726115771182011729

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 40962 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug8001600240032004000SE +/- 8.30, N = 3SE +/- 9.96, N = 3SE +/- 8.49, N = 3SE +/- 4.58, N = 3SE +/- 0.15, N = 3SE +/- 8.63, N = 3SE +/- 13.90, N = 3SE +/- 1.10, N = 3SE +/- 13.95, N = 3SE +/- 11.64, N = 3SE +/- 4.73, N = 3SE +/- 4.75, N = 3SE +/- 5.34, N = 3SE +/- 7.30, N = 3SE +/- 15.54, N = 3SE +/- 16.37, N = 3SE +/- 10.45, N = 3SE +/- 5.80, N = 3SE +/- 17.36, N = 33907.03936.03950.03924.23934.43895.43912.13909.53917.43931.33920.73923.93921.73911.53919.33932.33927.13934.63914.8

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 5122 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug8001600240032004000SE +/- 41.40, N = 5SE +/- 5.58, N = 3SE +/- 6.57, N = 3SE +/- 20.08, N = 3SE +/- 14.05, N = 3SE +/- 13.13, N = 3SE +/- 10.93, N = 3SE +/- 38.03, N = 5SE +/- 14.36, N = 3SE +/- 16.55, N = 3SE +/- 3.13, N = 3SE +/- 6.77, N = 3SE +/- 7.10, N = 3SE +/- 34.42, N = 12SE +/- 9.69, N = 3SE +/- 14.01, N = 3SE +/- 11.78, N = 3SE +/- 21.49, N = 3SE +/- 10.97, N = 33736.53767.93761.93769.93776.93773.33797.53679.23751.13775.93785.13780.63784.43748.43759.43768.03776.03780.83780.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 20482 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 107.49, N = 3SE +/- 20.00, N = 3SE +/- 76.47, N = 3SE +/- 42.13, N = 3SE +/- 20.01, N = 3SE +/- 75.96, N = 3SE +/- 78.24, N = 3SE +/- 24.95, N = 3SE +/- 15.63, N = 3SE +/- 53.18, N = 3SE +/- 16.00, N = 3SE +/- 58.95, N = 3SE +/- 33.86, N = 3SE +/- 37.49, N = 3SE +/- 27.28, N = 3SE +/- 143.99, N = 3SE +/- 130.85, N = 3SE +/- 97.36, N = 3SE +/- 41.31, N = 315539154281542615564155711546115457155431555115407153891556815538155261563115367153381548115513

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1282 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug10002000300040005000SE +/- 3.10, N = 3SE +/- 1.52, N = 3SE +/- 52.07, N = 15SE +/- 5.65, N = 3SE +/- 51.58, N = 15SE +/- 2.46, N = 3SE +/- 35.10, N = 9SE +/- 1.43, N = 3SE +/- 2.94, N = 3SE +/- 4.02, N = 3SE +/- 9.28, N = 3SE +/- 4.33, N = 3SE +/- 2.25, N = 3SE +/- 2.45, N = 3SE +/- 2.14, N = 3SE +/- 6.19, N = 3SE +/- 3.05, N = 3SE +/- 1.58, N = 3SE +/- 6.43, N = 34354.34433.24327.34405.04294.04414.64379.54358.84364.24402.84427.24415.54415.34408.44417.14412.34405.54434.74420.6

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug2K4K6K8K10KSE +/- 119.21, N = 3SE +/- 40.76, N = 3SE +/- 34.04, N = 3SE +/- 84.51, N = 3SE +/- 65.80, N = 3SE +/- 85.30, N = 3SE +/- 40.95, N = 3SE +/- 115.75, N = 3SE +/- 21.39, N = 3SE +/- 55.86, N = 3SE +/- 60.95, N = 3SE +/- 108.39, N = 3SE +/- 140.84, N = 3SE +/- 35.14, N = 3SE +/- 91.42, N = 3SE +/- 68.55, N = 3SE +/- 73.06, N = 3SE +/- 62.20, N = 3SE +/- 86.71, N = 311087114111146711408115481139911511110941113711465114821135611337114401137811393114021148011414

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 20482 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug9001800270036004500SE +/- 11.79, N = 3SE +/- 47.88, N = 4SE +/- 14.25, N = 3SE +/- 12.48, N = 3SE +/- 8.15, N = 3SE +/- 6.84, N = 3SE +/- 8.12, N = 3SE +/- 18.65, N = 3SE +/- 6.58, N = 3SE +/- 4.44, N = 3SE +/- 16.83, N = 3SE +/- 14.17, N = 3SE +/- 22.16, N = 3SE +/- 14.04, N = 3SE +/- 7.07, N = 3SE +/- 4.42, N = 3SE +/- 6.35, N = 3SE +/- 9.60, N = 3SE +/- 6.96, N = 34046.43989.44042.04051.54053.44051.34055.04046.94031.34045.54034.44048.34037.24030.64039.54046.44034.34063.84055.6

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1282 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug8001600240032004000SE +/- 9.25, N = 3SE +/- 21.71, N = 3SE +/- 4.10, N = 3SE +/- 17.74, N = 3SE +/- 30.58, N = 3SE +/- 7.88, N = 3SE +/- 3.71, N = 3SE +/- 31.53, N = 3SE +/- 21.66, N = 3SE +/- 11.59, N = 3SE +/- 29.76, N = 3SE +/- 14.94, N = 3SE +/- 28.74, N = 3SE +/- 18.17, N = 3SE +/- 4.22, N = 3SE +/- 24.20, N = 3SE +/- 2.08, N = 3SE +/- 29.21, N = 3SE +/- 4.06, N = 33848.03836.23805.33834.93824.53835.53861.63812.53839.43855.93837.33831.93821.53836.53836.43829.83865.13812.33855.8

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 112.78, N = 3SE +/- 78.73, N = 3SE +/- 15.17, N = 3SE +/- 75.24, N = 3SE +/- 129.47, N = 3SE +/- 67.72, N = 3SE +/- 80.68, N = 3SE +/- 43.66, N = 3SE +/- 32.22, N = 3SE +/- 82.71, N = 3SE +/- 21.01, N = 3SE +/- 85.56, N = 3SE +/- 94.45, N = 3SE +/- 73.63, N = 3SE +/- 141.62, N = 3SE +/- 64.06, N = 3SE +/- 135.76, N = 3SE +/- 78.81, N = 3SE +/- 80.85, N = 315517161671611916234159391614116059156351563016090161431609916169159621603716165159791622316266

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 642 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug2K4K6K8K10KSE +/- 54.50, N = 3SE +/- 144.31, N = 15SE +/- 40.83, N = 3SE +/- 85.65, N = 3SE +/- 101.28, N = 5SE +/- 99.10, N = 3SE +/- 114.62, N = 3SE +/- 33.87, N = 3SE +/- 98.25, N = 3SE +/- 69.65, N = 13SE +/- 7.77, N = 3SE +/- 18.19, N = 3SE +/- 18.11, N = 3SE +/- 141.95, N = 3SE +/- 15.38, N = 3SE +/- 31.09, N = 3SE +/- 17.46, N = 3SE +/- 10.59, N = 3SE +/- 172.96, N = 1510229.09996.310372.010418.010279.510350.010470.010131.09964.410357.810315.010354.010397.010256.010321.010529.010598.010410.09870.6

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2562 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug9001800270036004500SE +/- 27.37, N = 3SE +/- 24.27, N = 3SE +/- 19.87, N = 3SE +/- 5.17, N = 3SE +/- 17.44, N = 3SE +/- 1.57, N = 3SE +/- 15.53, N = 3SE +/- 1.60, N = 3SE +/- 24.49, N = 3SE +/- 3.18, N = 3SE +/- 43.31, N = 3SE +/- 22.07, N = 3SE +/- 17.30, N = 3SE +/- 8.48, N = 3SE +/- 53.04, N = 3SE +/- 2.74, N = 3SE +/- 12.05, N = 3SE +/- 10.82, N = 3SE +/- 1.72, N = 34316.14300.94313.44308.34317.04308.34290.54309.24312.84314.34241.64288.14329.54275.64247.74301.84335.84289.24296.2

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug9001800270036004500SE +/- 59.35, N = 3SE +/- 48.65, N = 4SE +/- 8.12, N = 3SE +/- 9.92, N = 3SE +/- 7.01, N = 3SE +/- 2.87, N = 3SE +/- 19.30, N = 3SE +/- 8.01, N = 3SE +/- 25.83, N = 3SE +/- 14.59, N = 3SE +/- 4.55, N = 3SE +/- 4.77, N = 3SE +/- 32.58, N = 3SE +/- 11.99, N = 3SE +/- 4.50, N = 3SE +/- 6.86, N = 3SE +/- 26.82, N = 3SE +/- 10.52, N = 3SE +/- 5.54, N = 34178.44232.84290.64263.24286.74282.94270.04233.74198.14267.94288.44280.94252.94276.04286.34271.74265.54282.54285.6

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2562 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug8001600240032004000SE +/- 23.89, N = 3SE +/- 47.63, N = 3SE +/- 12.33, N = 3SE +/- 33.28, N = 3SE +/- 23.72, N = 3SE +/- 13.35, N = 3SE +/- 7.11, N = 3SE +/- 17.09, N = 3SE +/- 37.88, N = 3SE +/- 12.43, N = 3SE +/- 35.49, N = 3SE +/- 16.18, N = 3SE +/- 33.79, N = 3SE +/- 34.93, N = 3SE +/- 34.82, N = 3SE +/- 4.70, N = 3SE +/- 20.14, N = 3SE +/- 50.06, N = 3SE +/- 23.34, N = 33634.93589.53654.93648.03610.33686.73693.33671.33616.93641.63583.53622.23655.83642.83673.03663.63629.73603.63660.6

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 5122 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 127.67, N = 3SE +/- 89.29, N = 3SE +/- 109.69, N = 3SE +/- 71.36, N = 3SE +/- 163.42, N = 3SE +/- 81.91, N = 3SE +/- 120.34, N = 3SE +/- 157.18, N = 3SE +/- 66.70, N = 3SE +/- 127.81, N = 3SE +/- 45.83, N = 3SE +/- 195.78, N = 3SE +/- 104.45, N = 3SE +/- 60.73, N = 3SE +/- 63.39, N = 3SE +/- 109.70, N = 3SE +/- 89.79, N = 3SE +/- 168.49, N = 3SE +/- 123.58, N = 315270157041557615612155451556015471152891540015538152101529415404154141557415380154031552515552

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 642 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug9001800270036004500SE +/- 5.04, N = 3SE +/- 51.49, N = 4SE +/- 2.62, N = 3SE +/- 22.90, N = 3SE +/- 60.33, N = 3SE +/- 4.40, N = 3SE +/- 46.52, N = 4SE +/- 4.58, N = 3SE +/- 7.74, N = 3SE +/- 61.56, N = 3SE +/- 3.34, N = 3SE +/- 22.79, N = 3SE +/- 18.00, N = 3SE +/- 4.27, N = 3SE +/- 21.71, N = 3SE +/- 56.44, N = 3SE +/- 52.12, N = 4SE +/- 60.71, N = 3SE +/- 21.54, N = 34336.64206.74347.44316.44265.14336.74289.84337.14354.74286.44321.44320.84337.34318.64326.34260.54291.04280.94309.2

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug11002200330044005500SE +/- 4.74, N = 3SE +/- 13.08, N = 3SE +/- 26.35, N = 3SE +/- 31.02, N = 3SE +/- 2.77, N = 3SE +/- 8.35, N = 3SE +/- 23.10, N = 3SE +/- 17.04, N = 3SE +/- 95.20, N = 13SE +/- 15.07, N = 3SE +/- 18.68, N = 3SE +/- 31.02, N = 3SE +/- 13.97, N = 3SE +/- 25.51, N = 3SE +/- 39.97, N = 3SE +/- 1.59, N = 3SE +/- 24.70, N = 3SE +/- 15.16, N = 3SE +/- 29.89, N = 35024.64997.45019.85005.75021.34967.15020.85008.94898.05000.05008.84957.94990.14962.74963.85027.45004.94979.94998.5

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1282 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 136.77, N = 3SE +/- 79.22, N = 3SE +/- 52.35, N = 3SE +/- 169.95, N = 3SE +/- 46.51, N = 3SE +/- 160.95, N = 3SE +/- 33.80, N = 3SE +/- 84.00, N = 3SE +/- 80.49, N = 3SE +/- 132.00, N = 5SE +/- 120.91, N = 3SE +/- 133.00, N = 3SE +/- 67.87, N = 3SE +/- 45.94, N = 3SE +/- 130.44, N = 3SE +/- 93.99, N = 3SE +/- 165.41, N = 3SE +/- 90.08, N = 3SE +/- 136.03, N = 1512740128311314612908132191302613206127891285013167131431307213000132451305813031130951301912702

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2562 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug3K6K9K12K15KSE +/- 67.57, N = 3SE +/- 150.68, N = 4SE +/- 169.36, N = 3SE +/- 71.88, N = 3SE +/- 146.24, N = 4SE +/- 30.75, N = 3SE +/- 28.45, N = 3SE +/- 68.09, N = 3SE +/- 58.29, N = 3SE +/- 145.87, N = 3SE +/- 127.87, N = 3SE +/- 145.93, N = 4SE +/- 165.24, N = 3SE +/- 115.02, N = 3SE +/- 36.46, N = 3SE +/- 137.11, N = 3SE +/- 169.66, N = 3SE +/- 101.06, N = 3SE +/- 98.34, N = 313831140441412414150140091398914032136511379514247142661412513997144551398314328143651379413679

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug9001800270036004500SE +/- 22.16, N = 3SE +/- 23.58, N = 3SE +/- 15.02, N = 3SE +/- 7.37, N = 3SE +/- 10.16, N = 3SE +/- 12.50, N = 3SE +/- 15.04, N = 3SE +/- 2.84, N = 3SE +/- 11.47, N = 3SE +/- 8.76, N = 3SE +/- 13.39, N = 3SE +/- 3.08, N = 3SE +/- 10.85, N = 3SE +/- 27.65, N = 3SE +/- 4.56, N = 3SE +/- 32.02, N = 3SE +/- 18.08, N = 3SE +/- 16.20, N = 3SE +/- 16.10, N = 34285.84298.44352.14334.24332.94344.44345.74340.14314.44366.84338.54340.04367.84334.24346.84319.94354.24350.44320.4

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug4K8K12K16K20KSE +/- 107.35, N = 3SE +/- 87.96, N = 3SE +/- 194.05, N = 3SE +/- 197.17, N = 3SE +/- 81.03, N = 3SE +/- 136.41, N = 3SE +/- 139.56, N = 3SE +/- 131.37, N = 3SE +/- 102.53, N = 3SE +/- 142.90, N = 3SE +/- 33.40, N = 3SE +/- 42.15, N = 3SE +/- 23.39, N = 3SE +/- 64.02, N = 3SE +/- 92.64, N = 3SE +/- 123.12, N = 3SE +/- 107.68, N = 3SE +/- 72.72, N = 3SE +/- 98.77, N = 313394166401631116344165541654816526134711341416344163481666816519164441637316548164481648716440

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 642 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug11002200330044005500SE +/- 7.41, N = 3SE +/- 14.95, N = 3SE +/- 6.59, N = 3SE +/- 2.54, N = 3SE +/- 0.13, N = 3SE +/- 3.97, N = 3SE +/- 5.21, N = 3SE +/- 61.09, N = 3SE +/- 4.42, N = 3SE +/- 11.20, N = 3SE +/- 10.53, N = 3SE +/- 7.40, N = 3SE +/- 8.65, N = 3SE +/- 6.01, N = 3SE +/- 3.52, N = 3SE +/- 15.45, N = 3SE +/- 7.46, N = 3SE +/- 4.79, N = 3SE +/- 2.78, N = 34899.54885.04899.64894.24891.04907.74897.94846.94899.94902.94893.84892.04899.14911.44905.74899.64893.44914.14906.8

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2GVNO2GVNO3NewGVNO2NewGVNO333NoGVNO2NoGVNO3OptNoSimplO2OptNoSimplO3OptPREO2OptPREO3OptRedO2OptRedO3OptSimplO2OptSimplO3PessimisticNewGVNO2PessimisticO2PessimisticO3debug11002200330044005500SE +/- 6.03, N = 3SE +/- 33.03, N = 3SE +/- 39.92, N = 3SE +/- 1.29, N = 3SE +/- 11.03, N = 3SE +/- 12.52, N = 3SE +/- 6.35, N = 3SE +/- 43.81, N = 3SE +/- 41.81, N = 3SE +/- 55.21, N = 3SE +/- 5.69, N = 3SE +/- 32.60, N = 3SE +/- 11.16, N = 3SE +/- 6.97, N = 3SE +/- 12.54, N = 3SE +/- 33.93, N = 3SE +/- 11.17, N = 3SE +/- 36.57, N = 3SE +/- 7.51, N = 35115.05179.15176.45213.25180.45211.55219.05127.95144.65144.85209.05189.35220.55217.95213.45173.55201.15176.15218.7


Phoronix Test Suite v10.8.5