fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2411280-NE-FFTW120RU64&grr&rdt.

fftw-1.2.0runProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutiondebugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x7682000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850ASUS VW1905.14.0-474.el9.x86_64 (x86_64)5.14.0-480.el9.x86_64 (x86_64)256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P5.14.0-496.el9.x86_64 (x86_64)GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.25.14.0-503.el9.x86_64 (x86_64)5.14.0-514.el9.x86_64 (x86_64)2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 8505.14.0-529.el9.x86_64 (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptNoSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false"- OptSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptSimplO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=true"- OptRedO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptRedO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=true -mllvm -enable-newgvn-simpl=true -mllvm -enable-newgvn-pre=false"- OptPREO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- GVNO2: CXXFLAGS=-O2 CFLAGS=-O2- NewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- OptPREO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO333: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- GVNO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO3: CXXFLAGS=-O3 CFLAGS=-O3- PessimisticO2: CXXFLAGS=-O2 CFLAGS=-O2- PessimisticNewGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- PessimisticNewGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn" CFLAGS="-O3 -mllvm -enable-newgvn"- NewGVNO2-debug: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"- rebase: CXXFLAGS="-O2 -mllvm -enable-newgvn" CFLAGS="-O2 -mllvm -enable-newgvn"Compiler Details- Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686Processor Details- Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42eSecurity Details- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

fftw-1.2.0runfftw: Float + SSE - 2D FFT Size 4096fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 2048fftw: Stock - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 1024fftw: Stock - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 2D FFT Size 512fftw: Stock - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 128fftw: Stock - 1D FFT Size 128fftw: Stock - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 2D FFT Size 256fftw: Stock - 1D FFT Size 2048fftw: Stock - 2D FFT Size 128fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 1D FFT Size 1024fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 256fftw: Stock - 2D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Stock - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 128fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase6092.72751.46459.12706.610890.73123.1140917287.0118703914.814982117294420.63780.715513114144055.63855.89870.6162664296.24285.6136793660.6155524309.2127024998.54320.4164404906.85218.76717.92756.07027.02723.7123163091.6141867311.4118033895.415878117434414.63773.315461113994051.33835.510350161414308.34282.9139893686.7155604336.7130264967.14344.4165484907.75211.56717.12738.97077.72736.0122213107.2143537347.4117503912.115952119524379.53797.515457115114055.03861.610470160594290.54270.0140323693.3154714289.8132065020.84345.7165264897.95219.05603.72231.16728.22662.0119033053.1142436805.0117143909.513502121884358.83679.215543110944046.93812.510131156354309.24233.7136513671.3152894337.1127895008.94340.1134714846.95127.95618.52239.36665.92642.1119063048.5142596925.7117153907.013636122954354.33736.515539110874046.43848.010229155174316.14178.4138313634.9152704336.6127405024.64285.8133944899.55115.05634.22200.06731.12664.3119493018.7143176912.7117783917.413740125914364.23751.115551111374031.33839.49964.4156304312.84198.1137953616.9154004354.7128504898.04314.4134144899.95144.66694.62757.97024.02726.6123373110.3143377116.0117803911.515782115784408.43748.415526114404030.63836.510256159624275.64276.0144553642.8154144318.6132454962.74334.2164444911.45217.96707.92758.97072.12725.0121823083.8141857041.3118583919.315981119344417.13759.415631113784039.53836.410321160374247.74286.3139833673.0155744326.3130584963.84346.8163734905.75213.46697.32773.97021.62728.1122023123.4142347252.4118363923.915989116094415.53780.615568113564048.33831.910354160994288.14280.9141253622.2152944320.8130724957.94340.0166684892.05189.36718.82767.17059.02707.2122983149.1144307194.9117093921.715845119434415.33784.415538113374037.23821.510397161694329.54252.9139973655.8154044337.3130004990.14367.8165194899.15220.56688.42762.47005.52710.8121253126.0142797355.9118213931.315388117884402.83775.915407114654045.53855.910357.8160904314.34267.9142473641.6155384286.4131675000.04366.8163444902.95144.86703.22755.06983.62716.2122183142.0143637205.1118413936.016035117394433.23767.915428114113989.43836.29996.3161674300.94232.8140443589.5157044206.7128314997.44298.4166404885.05179.16697.22752.96988.02727.8122133138.1144387397.6116523924.215906117964405.03769.915564114084051.53834.910418162344308.34263.2141503648.0156124316.4129085005.74334.2163444894.25213.26707.02755.46992.72700.4121323085.7139797126.7118933920.716141117814427.23785.115389114824034.43837.310315161434241.64288.4142663583.5152104321.4131435008.84338.5163484893.85209.06732.22745.87080.82718.0119903114.7144347316.4118033934.416110119914294.03776.915571115484053.43824.510279.5159394317.04286.7140093610.3155454265.1132195021.34332.9165544891.05180.46724.52779.87049.22705.0120643120.2142897243.1118183950.015842117954327.33761.915426114674042.03805.310372161194313.44290.6141243654.9155764347.4131465019.84352.1163114899.65176.46716.12774.87085.32726.8121313133.0143757166.1118213934.615611118204434.73780.815481114804063.83812.310410162234289.24282.5137943603.6155254280.9130194979.94350.4164874914.15176.16692.22766.57027.52721.2121293097.4143227191.3118063927.115994115774405.53776.015338114024034.33865.110598159794335.84265.5143653629.7154034291.0130955004.94354.2164484893.45201.16718.42758.07001.62731.4122423118.2142957420.8116933932.315787117264412.33768.015367113934046.43829.810529161654301.84271.7143283663.6153804260.5130315027.44319.9165484899.65173.56710.82744.47068.92707.5121783134.6144447379.9118383949.115873117464295.73772.515556114344049.13855.410378.0160654293.24276.9140913643.3155084287.6132385020.04332.4163784884.45208.73177.72801.53286.82725.84191.73116.84298.55044.34134.83920.95134.14335.84406.13768.54427.54121.44045.03868.55058.94548.84305.04248.94442.73684.24454.74216.34526.95009.94278.35238.94897.45196.93189.82766.13281.82718.04198.93083.74290.15010.34126.23934.65137.84309.24357.83767.64430.64126.74059.63848.25116.74528.84274.24271.54525.93649.54503.64336.34530.35021.84318.75227.54900.85220.4OpenBenchmarking.org

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase14002800420056007000SE +/- 262.37, N = 9SE +/- 13.11, N = 3SE +/- 21.46, N = 3SE +/- 4.28, N = 3SE +/- 8.82, N = 3SE +/- 7.27, N = 3SE +/- 9.08, N = 3SE +/- 11.45, N = 3SE +/- 4.50, N = 3SE +/- 15.62, N = 3SE +/- 15.10, N = 3SE +/- 5.39, N = 3SE +/- 10.55, N = 3SE +/- 20.27, N = 3SE +/- 23.42, N = 3SE +/- 23.90, N = 3SE +/- 8.99, N = 3SE +/- 22.81, N = 3SE +/- 6.00, N = 3SE +/- 15.73, N = 3SE +/- 8.33, N = 3SE +/- 3.21, N = 36092.76717.96717.15603.75618.55634.26694.66707.96697.36718.86688.46703.26697.26707.06732.26724.56716.16692.26718.46710.83177.73189.8

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase6001200180024003000SE +/- 5.74, N = 3SE +/- 14.23, N = 3SE +/- 16.56, N = 3SE +/- 16.21, N = 3SE +/- 8.09, N = 3SE +/- 21.99, N = 15SE +/- 8.89, N = 3SE +/- 5.14, N = 3SE +/- 15.05, N = 3SE +/- 13.13, N = 3SE +/- 11.48, N = 3SE +/- 5.66, N = 3SE +/- 8.68, N = 3SE +/- 7.45, N = 3SE +/- 13.80, N = 3SE +/- 19.40, N = 3SE +/- 23.10, N = 3SE +/- 23.64, N = 3SE +/- 7.03, N = 3SE +/- 10.94, N = 3SE +/- 17.44, N = 3SE +/- 36.78, N = 32751.42756.02738.92231.12239.32200.02757.92758.92773.92767.12762.42755.02752.92755.42745.82779.82774.82766.52758.02744.42801.52766.1

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase15003000450060007500SE +/- 44.49, N = 3SE +/- 11.97, N = 3SE +/- 8.17, N = 3SE +/- 33.67, N = 3SE +/- 41.03, N = 3SE +/- 25.37, N = 3SE +/- 15.79, N = 3SE +/- 14.30, N = 3SE +/- 19.13, N = 3SE +/- 28.87, N = 3SE +/- 26.38, N = 3SE +/- 18.93, N = 3SE +/- 12.08, N = 3SE +/- 43.40, N = 3SE +/- 2.80, N = 3SE +/- 2.26, N = 3SE +/- 10.65, N = 3SE +/- 24.24, N = 3SE +/- 48.51, N = 3SE +/- 14.84, N = 3SE +/- 7.16, N = 3SE +/- 7.76, N = 36459.17027.07077.76728.26665.96731.17024.07072.17021.67059.07005.56983.66988.06992.77080.87049.27085.37027.57001.67068.93286.83281.8

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase6001200180024003000SE +/- 3.24, N = 3SE +/- 8.21, N = 3SE +/- 12.56, N = 3SE +/- 6.15, N = 3SE +/- 5.49, N = 3SE +/- 6.56, N = 3SE +/- 17.93, N = 3SE +/- 16.84, N = 3SE +/- 16.99, N = 3SE +/- 12.30, N = 3SE +/- 13.94, N = 3SE +/- 13.80, N = 3SE +/- 19.22, N = 3SE +/- 11.61, N = 3SE +/- 6.49, N = 3SE +/- 18.48, N = 3SE +/- 5.76, N = 3SE +/- 14.64, N = 3SE +/- 12.84, N = 3SE +/- 4.65, N = 3SE +/- 3.87, N = 3SE +/- 14.16, N = 32706.62723.72736.02662.02642.12664.32726.62725.02728.12707.22710.82716.22727.82700.42718.02705.02726.82721.22731.42707.52725.82718.0

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 644.68, N = 12SE +/- 81.04, N = 3SE +/- 76.28, N = 3SE +/- 124.86, N = 3SE +/- 40.01, N = 3SE +/- 58.86, N = 3SE +/- 48.60, N = 3SE +/- 70.39, N = 3SE +/- 76.29, N = 3SE +/- 15.68, N = 3SE +/- 58.30, N = 3SE +/- 69.17, N = 3SE +/- 63.84, N = 3SE +/- 74.77, N = 3SE +/- 72.25, N = 3SE +/- 43.51, N = 3SE +/- 46.46, N = 3SE +/- 44.50, N = 3SE +/- 81.50, N = 3SE +/- 55.07, N = 3SE +/- 4.89, N = 3SE +/- 7.33, N = 310890.712316.012221.011903.011906.011949.012337.012182.012202.012298.012125.012218.012213.012132.011990.012064.012131.012129.012242.012178.04191.74198.9

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase7001400210028003500SE +/- 15.08, N = 3SE +/- 13.41, N = 3SE +/- 38.36, N = 3SE +/- 12.20, N = 3SE +/- 13.57, N = 3SE +/- 8.28, N = 3SE +/- 26.70, N = 3SE +/- 10.08, N = 3SE +/- 14.89, N = 3SE +/- 9.39, N = 3SE +/- 8.38, N = 3SE +/- 5.57, N = 3SE +/- 18.27, N = 3SE +/- 15.98, N = 3SE +/- 24.45, N = 3SE +/- 12.60, N = 3SE +/- 1.11, N = 3SE +/- 31.08, N = 3SE +/- 24.52, N = 3SE +/- 17.57, N = 3SE +/- 23.65, N = 3SE +/- 25.89, N = 33123.13091.63107.23053.13048.53018.73110.33083.83123.43149.13126.03142.03138.13085.73114.73120.23133.03097.43118.23134.63116.83083.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 193.73, N = 3SE +/- 80.21, N = 3SE +/- 135.86, N = 3SE +/- 19.46, N = 3SE +/- 82.03, N = 3SE +/- 55.79, N = 3SE +/- 123.09, N = 3SE +/- 94.10, N = 3SE +/- 62.13, N = 3SE +/- 94.63, N = 3SE +/- 55.00, N = 3SE +/- 46.84, N = 3SE +/- 56.04, N = 3SE +/- 89.82, N = 3SE +/- 25.71, N = 3SE +/- 41.07, N = 3SE +/- 33.27, N = 3SE +/- 9.50, N = 3SE +/- 111.29, N = 3SE +/- 110.96, N = 3SE +/- 4.59, N = 3SE +/- 4.76, N = 314091.014186.014353.014243.014259.014317.014337.014185.014234.014430.014279.014363.014438.013979.014434.014289.014375.014322.014295.014444.04298.54290.1

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase16003200480064008000SE +/- 5.79, N = 3SE +/- 79.17, N = 3SE +/- 35.65, N = 3SE +/- 24.17, N = 3SE +/- 64.34, N = 15SE +/- 71.46, N = 15SE +/- 52.52, N = 15SE +/- 90.54, N = 15SE +/- 76.45, N = 5SE +/- 57.27, N = 15SE +/- 98.33, N = 3SE +/- 65.67, N = 7SE +/- 12.13, N = 3SE +/- 54.90, N = 15SE +/- 72.31, N = 3SE +/- 82.98, N = 4SE +/- 56.02, N = 15SE +/- 45.97, N = 3SE +/- 14.66, N = 3SE +/- 12.63, N = 3SE +/- 8.26, N = 3SE +/- 15.87, N = 37287.07311.47347.46805.06925.76912.77116.07041.37252.47194.97355.97205.17397.67126.77316.47243.17166.17191.37420.87379.95044.35010.3

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 97.11, N = 3SE +/- 84.01, N = 3SE +/- 19.97, N = 3SE +/- 64.54, N = 3SE +/- 37.37, N = 3SE +/- 61.50, N = 3SE +/- 128.25, N = 3SE +/- 36.67, N = 3SE +/- 93.72, N = 3SE +/- 39.68, N = 3SE +/- 83.95, N = 3SE +/- 40.29, N = 3SE +/- 64.51, N = 3SE +/- 57.89, N = 3SE +/- 41.82, N = 3SE +/- 52.21, N = 3SE +/- 60.34, N = 3SE +/- 74.64, N = 3SE +/- 106.71, N = 3SE +/- 127.52, N = 3SE +/- 1.80, N = 3SE +/- 16.98, N = 311870.011803.011750.011714.011715.011778.011780.011858.011836.011709.011821.011841.011652.011893.011803.011818.011821.011806.011693.011838.04134.84126.2

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase8001600240032004000SE +/- 17.36, N = 3SE +/- 8.63, N = 3SE +/- 13.90, N = 3SE +/- 1.10, N = 3SE +/- 8.30, N = 3SE +/- 13.95, N = 3SE +/- 7.30, N = 3SE +/- 15.54, N = 3SE +/- 4.75, N = 3SE +/- 5.34, N = 3SE +/- 11.64, N = 3SE +/- 9.96, N = 3SE +/- 4.58, N = 3SE +/- 4.73, N = 3SE +/- 0.15, N = 3SE +/- 8.49, N = 3SE +/- 5.80, N = 3SE +/- 10.45, N = 3SE +/- 16.37, N = 3SE +/- 8.18, N = 3SE +/- 9.04, N = 3SE +/- 2.03, N = 33914.83895.43912.13909.53907.03917.43911.53919.33923.93921.73931.33936.03924.23920.73934.43950.03934.63927.13932.33949.13920.93934.6

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 26.91, N = 3SE +/- 70.08, N = 3SE +/- 133.36, N = 3SE +/- 41.97, N = 3SE +/- 70.91, N = 3SE +/- 55.77, N = 3SE +/- 136.22, N = 15SE +/- 82.53, N = 3SE +/- 23.46, N = 3SE +/- 86.00, N = 3SE +/- 159.78, N = 15SE +/- 50.86, N = 3SE +/- 181.67, N = 3SE +/- 39.26, N = 3SE +/- 39.26, N = 3SE +/- 125.58, N = 15SE +/- 137.71, N = 15SE +/- 107.04, N = 3SE +/- 148.65, N = 7SE +/- 49.59, N = 3SE +/- 10.34, N = 3SE +/- 7.89, N = 314982.015878.015952.013502.013636.013740.015782.015981.015989.015845.015388.016035.015906.016141.016110.015842.015611.015994.015787.015873.05134.15137.8

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 84.15, N = 12SE +/- 95.52, N = 3SE +/- 92.80, N = 3SE +/- 113.85, N = 15SE +/- 82.96, N = 15SE +/- 50.90, N = 3SE +/- 111.70, N = 3SE +/- 47.40, N = 3SE +/- 48.56, N = 3SE +/- 104.94, N = 8SE +/- 94.72, N = 15SE +/- 95.45, N = 3SE +/- 91.43, N = 10SE +/- 147.41, N = 3SE +/- 86.04, N = 3SE +/- 105.79, N = 7SE +/- 101.06, N = 3SE +/- 136.55, N = 3SE +/- 62.19, N = 3SE +/- 118.42, N = 3SE +/- 1.05, N = 3SE +/- 28.54, N = 311729.011743.011952.012188.012295.012591.011578.011934.011609.011943.011788.011739.011796.011781.011991.011795.011820.011577.011726.011746.04335.84309.2

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase10002000300040005000SE +/- 6.43, N = 3SE +/- 2.46, N = 3SE +/- 35.10, N = 9SE +/- 1.43, N = 3SE +/- 3.10, N = 3SE +/- 2.94, N = 3SE +/- 2.45, N = 3SE +/- 2.14, N = 3SE +/- 4.33, N = 3SE +/- 2.25, N = 3SE +/- 4.02, N = 3SE +/- 1.52, N = 3SE +/- 5.65, N = 3SE +/- 9.28, N = 3SE +/- 51.58, N = 15SE +/- 52.07, N = 15SE +/- 1.58, N = 3SE +/- 3.05, N = 3SE +/- 6.19, N = 3SE +/- 36.81, N = 15SE +/- 2.58, N = 3SE +/- 34.36, N = 34420.64414.64379.54358.84354.34364.24408.44417.14415.54415.34402.84433.24405.04427.24294.04327.34434.74405.54412.34295.74406.14357.8

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase8001600240032004000SE +/- 10.97, N = 3SE +/- 13.13, N = 3SE +/- 10.93, N = 3SE +/- 38.03, N = 5SE +/- 41.40, N = 5SE +/- 14.36, N = 3SE +/- 34.42, N = 12SE +/- 9.69, N = 3SE +/- 6.77, N = 3SE +/- 7.10, N = 3SE +/- 16.55, N = 3SE +/- 5.58, N = 3SE +/- 20.08, N = 3SE +/- 3.13, N = 3SE +/- 14.05, N = 3SE +/- 6.57, N = 3SE +/- 21.49, N = 3SE +/- 11.78, N = 3SE +/- 14.01, N = 3SE +/- 2.02, N = 3SE +/- 5.82, N = 3SE +/- 1.97, N = 33780.73773.33797.53679.23736.53751.13748.43759.43780.63784.43775.93767.93769.93785.13776.93761.93780.83776.03768.03772.53768.53767.6

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 41.31, N = 3SE +/- 75.96, N = 3SE +/- 78.24, N = 3SE +/- 24.95, N = 3SE +/- 107.49, N = 3SE +/- 15.63, N = 3SE +/- 37.49, N = 3SE +/- 27.28, N = 3SE +/- 58.95, N = 3SE +/- 33.86, N = 3SE +/- 53.18, N = 3SE +/- 20.00, N = 3SE +/- 42.13, N = 3SE +/- 16.00, N = 3SE +/- 20.01, N = 3SE +/- 76.47, N = 3SE +/- 97.36, N = 3SE +/- 130.85, N = 3SE +/- 143.99, N = 3SE +/- 17.24, N = 3SE +/- 17.00, N = 3SE +/- 23.83, N = 315513.015461.015457.015543.015539.015551.015526.015631.015568.015538.015407.015428.015564.015389.015571.015426.015481.015338.015367.015556.04427.54430.6

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase2K4K6K8K10KSE +/- 86.71, N = 3SE +/- 85.30, N = 3SE +/- 40.95, N = 3SE +/- 115.75, N = 3SE +/- 119.21, N = 3SE +/- 21.39, N = 3SE +/- 35.14, N = 3SE +/- 91.42, N = 3SE +/- 108.39, N = 3SE +/- 140.84, N = 3SE +/- 55.86, N = 3SE +/- 40.76, N = 3SE +/- 84.51, N = 3SE +/- 60.95, N = 3SE +/- 65.80, N = 3SE +/- 34.04, N = 3SE +/- 62.20, N = 3SE +/- 73.06, N = 3SE +/- 68.55, N = 3SE +/- 62.20, N = 3SE +/- 11.50, N = 3SE +/- 7.91, N = 311414.011399.011511.011094.011087.011137.011440.011378.011356.011337.011465.011411.011408.011482.011548.011467.011480.011402.011393.011434.04121.44126.7

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase9001800270036004500SE +/- 6.96, N = 3SE +/- 6.84, N = 3SE +/- 8.12, N = 3SE +/- 18.65, N = 3SE +/- 11.79, N = 3SE +/- 6.58, N = 3SE +/- 14.04, N = 3SE +/- 7.07, N = 3SE +/- 14.17, N = 3SE +/- 22.16, N = 3SE +/- 4.44, N = 3SE +/- 47.88, N = 4SE +/- 12.48, N = 3SE +/- 16.83, N = 3SE +/- 8.15, N = 3SE +/- 14.25, N = 3SE +/- 9.60, N = 3SE +/- 6.35, N = 3SE +/- 4.42, N = 3SE +/- 11.53, N = 3SE +/- 4.58, N = 3SE +/- 2.38, N = 34055.64051.34055.04046.94046.44031.34030.64039.54048.34037.24045.53989.44051.54034.44053.44042.04063.84034.34046.44049.14045.04059.6

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase8001600240032004000SE +/- 4.06, N = 3SE +/- 7.88, N = 3SE +/- 3.71, N = 3SE +/- 31.53, N = 3SE +/- 9.25, N = 3SE +/- 21.66, N = 3SE +/- 18.17, N = 3SE +/- 4.22, N = 3SE +/- 14.94, N = 3SE +/- 28.74, N = 3SE +/- 11.59, N = 3SE +/- 21.71, N = 3SE +/- 17.74, N = 3SE +/- 29.76, N = 3SE +/- 30.58, N = 3SE +/- 4.10, N = 3SE +/- 29.21, N = 3SE +/- 2.08, N = 3SE +/- 24.20, N = 3SE +/- 4.48, N = 3SE +/- 1.97, N = 3SE +/- 13.01, N = 33855.83835.53861.63812.53848.03839.43836.53836.43831.93821.53855.93836.23834.93837.33824.53805.33812.33865.13829.83855.43868.53848.2

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase2K4K6K8K10KSE +/- 172.96, N = 15SE +/- 99.10, N = 3SE +/- 114.62, N = 3SE +/- 33.87, N = 3SE +/- 54.50, N = 3SE +/- 98.25, N = 3SE +/- 141.95, N = 3SE +/- 15.38, N = 3SE +/- 18.19, N = 3SE +/- 18.11, N = 3SE +/- 69.65, N = 13SE +/- 144.31, N = 15SE +/- 85.65, N = 3SE +/- 7.77, N = 3SE +/- 101.28, N = 5SE +/- 40.83, N = 3SE +/- 10.59, N = 3SE +/- 17.46, N = 3SE +/- 31.09, N = 3SE +/- 80.37, N = 10SE +/- 54.47, N = 3SE +/- 20.81, N = 39870.610350.010470.010131.010229.09964.410256.010321.010354.010397.010357.89996.310418.010315.010279.510372.010410.010598.010529.010378.05058.95116.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 80.85, N = 3SE +/- 67.72, N = 3SE +/- 80.68, N = 3SE +/- 43.66, N = 3SE +/- 112.78, N = 3SE +/- 32.22, N = 3SE +/- 73.63, N = 3SE +/- 141.62, N = 3SE +/- 85.56, N = 3SE +/- 94.45, N = 3SE +/- 82.71, N = 3SE +/- 78.73, N = 3SE +/- 75.24, N = 3SE +/- 21.01, N = 3SE +/- 129.47, N = 3SE +/- 15.17, N = 3SE +/- 78.81, N = 3SE +/- 135.76, N = 3SE +/- 64.06, N = 3SE +/- 86.33, N = 3SE +/- 9.20, N = 3SE +/- 12.89, N = 316266.016141.016059.015635.015517.015630.015962.016037.016099.016169.016090.016167.016234.016143.015939.016119.016223.015979.016165.016065.04548.84528.8

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase9001800270036004500SE +/- 1.72, N = 3SE +/- 1.57, N = 3SE +/- 15.53, N = 3SE +/- 1.60, N = 3SE +/- 27.37, N = 3SE +/- 24.49, N = 3SE +/- 8.48, N = 3SE +/- 53.04, N = 3SE +/- 22.07, N = 3SE +/- 17.30, N = 3SE +/- 3.18, N = 3SE +/- 24.27, N = 3SE +/- 5.17, N = 3SE +/- 43.31, N = 3SE +/- 17.44, N = 3SE +/- 19.87, N = 3SE +/- 10.82, N = 3SE +/- 12.05, N = 3SE +/- 2.74, N = 3SE +/- 4.79, N = 3SE +/- 7.39, N = 3SE +/- 7.67, N = 34296.24308.34290.54309.24316.14312.84275.64247.74288.14329.54314.34300.94308.34241.64317.04313.44289.24335.84301.84293.24305.04274.2

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase9001800270036004500SE +/- 5.54, N = 3SE +/- 2.87, N = 3SE +/- 19.30, N = 3SE +/- 8.01, N = 3SE +/- 59.35, N = 3SE +/- 25.83, N = 3SE +/- 11.99, N = 3SE +/- 4.50, N = 3SE +/- 4.77, N = 3SE +/- 32.58, N = 3SE +/- 14.59, N = 3SE +/- 48.65, N = 4SE +/- 9.92, N = 3SE +/- 4.55, N = 3SE +/- 7.01, N = 3SE +/- 8.12, N = 3SE +/- 10.52, N = 3SE +/- 26.82, N = 3SE +/- 6.86, N = 3SE +/- 28.59, N = 3SE +/- 21.17, N = 3SE +/- 26.52, N = 34285.64282.94270.04233.74178.44198.14276.04286.34280.94252.94267.94232.84263.24288.44286.74290.64282.54265.54271.74276.94248.94271.5

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 98.34, N = 3SE +/- 30.75, N = 3SE +/- 28.45, N = 3SE +/- 68.09, N = 3SE +/- 67.57, N = 3SE +/- 58.29, N = 3SE +/- 115.02, N = 3SE +/- 36.46, N = 3SE +/- 145.93, N = 4SE +/- 165.24, N = 3SE +/- 145.87, N = 3SE +/- 150.68, N = 4SE +/- 71.88, N = 3SE +/- 127.87, N = 3SE +/- 146.24, N = 4SE +/- 169.36, N = 3SE +/- 101.06, N = 3SE +/- 169.66, N = 3SE +/- 137.11, N = 3SE +/- 95.21, N = 13SE +/- 15.23, N = 3SE +/- 3.31, N = 313679.013989.014032.013651.013831.013795.014455.013983.014125.013997.014247.014044.014150.014266.014009.014124.013794.014365.014328.014091.04442.74525.9

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase8001600240032004000SE +/- 23.34, N = 3SE +/- 13.35, N = 3SE +/- 7.11, N = 3SE +/- 17.09, N = 3SE +/- 23.89, N = 3SE +/- 37.88, N = 3SE +/- 34.93, N = 3SE +/- 34.82, N = 3SE +/- 16.18, N = 3SE +/- 33.79, N = 3SE +/- 12.43, N = 3SE +/- 47.63, N = 3SE +/- 33.28, N = 3SE +/- 35.49, N = 3SE +/- 23.72, N = 3SE +/- 12.33, N = 3SE +/- 50.06, N = 3SE +/- 20.14, N = 3SE +/- 4.70, N = 3SE +/- 11.87, N = 3SE +/- 1.36, N = 3SE +/- 33.75, N = 33660.63686.73693.33671.33634.93616.93642.83673.03622.23655.83641.63589.53648.03583.53610.33654.93603.63629.73663.63643.33684.23649.5

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 123.58, N = 3SE +/- 81.91, N = 3SE +/- 120.34, N = 3SE +/- 157.18, N = 3SE +/- 127.67, N = 3SE +/- 66.70, N = 3SE +/- 60.73, N = 3SE +/- 63.39, N = 3SE +/- 195.78, N = 3SE +/- 104.45, N = 3SE +/- 127.81, N = 3SE +/- 89.29, N = 3SE +/- 71.36, N = 3SE +/- 45.83, N = 3SE +/- 163.42, N = 3SE +/- 109.69, N = 3SE +/- 168.49, N = 3SE +/- 89.79, N = 3SE +/- 109.70, N = 3SE +/- 143.51, N = 3SE +/- 27.78, N = 3SE +/- 4.42, N = 315552.015560.015471.015289.015270.015400.015414.015574.015294.015404.015538.015704.015612.015210.015545.015576.015525.015403.015380.015508.04454.74503.6

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase9001800270036004500SE +/- 21.54, N = 3SE +/- 4.40, N = 3SE +/- 46.52, N = 4SE +/- 4.58, N = 3SE +/- 5.04, N = 3SE +/- 7.74, N = 3SE +/- 4.27, N = 3SE +/- 21.71, N = 3SE +/- 22.79, N = 3SE +/- 18.00, N = 3SE +/- 61.56, N = 3SE +/- 51.49, N = 4SE +/- 22.90, N = 3SE +/- 3.34, N = 3SE +/- 60.33, N = 3SE +/- 2.62, N = 3SE +/- 60.71, N = 3SE +/- 52.12, N = 4SE +/- 56.44, N = 3SE +/- 48.59, N = 4SE +/- 54.17, N = 3SE +/- 27.50, N = 34309.24336.74289.84337.14336.64354.74318.64326.34320.84337.34286.44206.74316.44321.44265.14347.44280.94291.04260.54287.64216.34336.3

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase3K6K9K12K15KSE +/- 136.03, N = 15SE +/- 160.95, N = 3SE +/- 33.80, N = 3SE +/- 84.00, N = 3SE +/- 136.77, N = 3SE +/- 80.49, N = 3SE +/- 45.94, N = 3SE +/- 130.44, N = 3SE +/- 133.00, N = 3SE +/- 67.87, N = 3SE +/- 132.00, N = 5SE +/- 79.22, N = 3SE +/- 169.95, N = 3SE +/- 120.91, N = 3SE +/- 46.51, N = 3SE +/- 52.35, N = 3SE +/- 90.08, N = 3SE +/- 165.41, N = 3SE +/- 93.99, N = 3SE +/- 63.14, N = 3SE +/- 0.33, N = 3SE +/- 7.16, N = 312702.013026.013206.012789.012740.012850.013245.013058.013072.013000.013167.012831.012908.013143.013219.013146.013019.013095.013031.013238.04526.94530.3

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase11002200330044005500SE +/- 29.89, N = 3SE +/- 8.35, N = 3SE +/- 23.10, N = 3SE +/- 17.04, N = 3SE +/- 4.74, N = 3SE +/- 95.20, N = 13SE +/- 25.51, N = 3SE +/- 39.97, N = 3SE +/- 31.02, N = 3SE +/- 13.97, N = 3SE +/- 15.07, N = 3SE +/- 13.08, N = 3SE +/- 31.02, N = 3SE +/- 18.68, N = 3SE +/- 2.77, N = 3SE +/- 26.35, N = 3SE +/- 15.16, N = 3SE +/- 24.70, N = 3SE +/- 1.59, N = 3SE +/- 8.67, N = 3SE +/- 18.86, N = 3SE +/- 1.79, N = 34998.54967.15020.85008.95024.64898.04962.74963.84957.94990.15000.04997.45005.75008.85021.35019.84979.95004.95027.45020.05009.95021.8

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase9001800270036004500SE +/- 16.10, N = 3SE +/- 12.50, N = 3SE +/- 15.04, N = 3SE +/- 2.84, N = 3SE +/- 22.16, N = 3SE +/- 11.47, N = 3SE +/- 27.65, N = 3SE +/- 4.56, N = 3SE +/- 3.08, N = 3SE +/- 10.85, N = 3SE +/- 8.76, N = 3SE +/- 23.58, N = 3SE +/- 7.37, N = 3SE +/- 13.39, N = 3SE +/- 10.16, N = 3SE +/- 15.02, N = 3SE +/- 16.20, N = 3SE +/- 18.08, N = 3SE +/- 32.02, N = 3SE +/- 19.75, N = 3SE +/- 38.65, N = 3SE +/- 18.34, N = 34320.44344.44345.74340.14285.84314.44334.24346.84340.04367.84366.84298.44334.24338.54332.94352.14350.44354.24319.94332.44278.34318.7

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase4K8K12K16K20KSE +/- 98.77, N = 3SE +/- 136.41, N = 3SE +/- 139.56, N = 3SE +/- 131.37, N = 3SE +/- 107.35, N = 3SE +/- 102.53, N = 3SE +/- 64.02, N = 3SE +/- 92.64, N = 3SE +/- 42.15, N = 3SE +/- 23.39, N = 3SE +/- 142.90, N = 3SE +/- 87.96, N = 3SE +/- 197.17, N = 3SE +/- 33.40, N = 3SE +/- 81.03, N = 3SE +/- 194.05, N = 3SE +/- 72.72, N = 3SE +/- 107.68, N = 3SE +/- 123.12, N = 3SE +/- 35.88, N = 3SE +/- 4.57, N = 3SE +/- 20.64, N = 316440.016548.016526.013471.013394.013414.016444.016373.016668.016519.016344.016640.016344.016348.016554.016311.016487.016448.016548.016378.05238.95227.5

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase11002200330044005500SE +/- 2.78, N = 3SE +/- 3.97, N = 3SE +/- 5.21, N = 3SE +/- 61.09, N = 3SE +/- 7.41, N = 3SE +/- 4.42, N = 3SE +/- 6.01, N = 3SE +/- 3.52, N = 3SE +/- 7.40, N = 3SE +/- 8.65, N = 3SE +/- 11.20, N = 3SE +/- 14.95, N = 3SE +/- 2.54, N = 3SE +/- 10.53, N = 3SE +/- 0.13, N = 3SE +/- 6.59, N = 3SE +/- 4.79, N = 3SE +/- 7.46, N = 3SE +/- 15.45, N = 3SE +/- 25.89, N = 3SE +/- 1.76, N = 3SE +/- 8.61, N = 34906.84907.74897.94846.94899.54899.94911.44905.74892.04899.14902.94885.04894.24893.84891.04899.64914.14893.44899.64884.44897.44900.8

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2OptNoSimplO3OptSimplO2OptSimplO3OptRedO2OptRedO3OptPREO2GVNO2NewGVNO2OptPREO3NewGVNO333GVNO3PessimisticO3PessimisticO2PessimisticNewGVNO2PessimisticNewGVNO3NewGVNO2-debugrebase11002200330044005500SE +/- 7.51, N = 3SE +/- 12.52, N = 3SE +/- 6.35, N = 3SE +/- 43.81, N = 3SE +/- 6.03, N = 3SE +/- 41.81, N = 3SE +/- 6.97, N = 3SE +/- 12.54, N = 3SE +/- 32.60, N = 3SE +/- 11.16, N = 3SE +/- 55.21, N = 3SE +/- 33.03, N = 3SE +/- 1.29, N = 3SE +/- 5.69, N = 3SE +/- 11.03, N = 3SE +/- 39.92, N = 3SE +/- 36.57, N = 3SE +/- 11.17, N = 3SE +/- 33.93, N = 3SE +/- 6.51, N = 3SE +/- 27.38, N = 3SE +/- 3.61, N = 35218.75211.55219.05127.95115.05144.65217.95213.45189.35220.55144.85179.15213.25209.05180.45176.45176.15201.15173.55208.75196.95220.4

FFTW

Test Install Size

OpenBenchmarking.orgBytes, Fewer Is BetterFFTW 3.3.6Test Install SizeNewGVNO2-debugrebase20K40K60K80K100K116112116112


Phoronix Test Suite v10.8.5