fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2407171-NE-FFTW120RU32
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
debug
July 13
  1 Hour, 54 Minutes
NoGVNO2
July 14
  57 Minutes
NoGVNO3
July 15
  57 Minutes
OptNoSimplO2
July 16
  17 Minutes
2 x Intel Xeon E5-2620 v2
July 17
  17 Minutes
Invert Hiding All Results Option
  52 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


fftw-1.2.0runOpenBenchmarking.orgPhoronix Test Suite2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionFftw-1.2.0run BenchmarksSystem Logs- Transparent Huge Pages: always- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" - NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" - OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" - 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" - Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686 - Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42e- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2Result OverviewPhoronix Test Suite100%106%112%118%FFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFloat + SSE - 2D FFT Size 32Stock - 2D FFT Size 4096Float + SSE - 2D FFT Size 4096Float + SSE - 2D FFT Size 64Float + SSE - 2D FFT Size 1024Float + SSE - 2D FFT Size 2048Float + SSE - 1D FFT Size 32Float + SSE - 1D FFT Size 64Float + SSE - 1D FFT Size 1024Float + SSE - 2D FFT Size 128Float + SSE - 1D FFT Size 128Float + SSE - 2D FFT Size 256Stock - 2D FFT Size 2048Stock - 2D FFT Size 512Float + SSE - 1D FFT Size 256Stock - 1D FFT Size 1024Stock - 2D FFT Size 1024Stock - 2D FFT Size 32Float + SSE - 1D FFT Size 512Float + SSE - 1D FFT Size 4096Stock - 2D FFT Size 256Stock - 1D FFT Size 128Stock - 1D FFT Size 512Float + SSE - 2D FFT Size 512Stock - 2D FFT Size 128Stock - 1D FFT Size 64Stock - 1D FFT Size 32Stock - 2D FFT Size 64Stock - 1D FFT Size 256Float + SSE - 1D FFT Size 2048Stock - 1D FFT Size 4096Stock - 1D FFT Size 2048

fftw-1.2.0runfftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 128fftw: Stock - 2D FFT Size 256fftw: Stock - 2D FFT Size 512fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v24998.54906.85218.74309.24420.64296.24320.43855.83660.63780.74285.64055.63914.83123.12706.62751.47287.09870.6164401498212702136791555211729114141187016266155131409110890.76459.16092.74967.14907.75211.54336.74414.64308.34344.43835.53686.73773.34282.94051.33895.43091.62723.72756.07311.4103501654815878130261398915560117431139911803161411546114186123167027.06717.95020.84897.95219.04289.84379.54290.54345.73861.63693.33797.54270.04055.03912.13107.22736.02738.97347.4104701652615952132061403215471119521151111750160591545714353122217077.76717.15008.94846.95127.94337.14358.84309.24340.13812.53671.33679.24233.74046.93909.53053.12662.02231.16805.0101311347113502127891365115289121881109411714156351554314243119036728.25603.75024.64899.55115.04336.64354.34316.14285.83848.03634.93736.54178.44046.43907.03048.52642.12239.36925.7102291339413636127401383115270122951108711715155171553914259119066665.95618.5OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v211002200330044005500SE +/- 29.89, N = 3SE +/- 8.35, N = 3SE +/- 23.10, N = 3SE +/- 17.04, N = 3SE +/- 4.74, N = 34998.54967.15020.85008.95024.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v211002200330044005500SE +/- 2.78, N = 3SE +/- 3.97, N = 3SE +/- 5.21, N = 3SE +/- 61.09, N = 3SE +/- 7.41, N = 34906.84907.74897.94846.94899.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v211002200330044005500SE +/- 7.51, N = 3SE +/- 12.52, N = 3SE +/- 6.35, N = 3SE +/- 43.81, N = 3SE +/- 6.03, N = 35218.75211.55219.05127.95115.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 21.54, N = 3SE +/- 4.40, N = 3SE +/- 46.52, N = 4SE +/- 4.58, N = 3SE +/- 5.04, N = 34309.24336.74289.84337.14336.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 6.43, N = 3SE +/- 2.46, N = 3SE +/- 35.10, N = 9SE +/- 1.43, N = 3SE +/- 3.10, N = 34420.64414.64379.54358.84354.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 1.72, N = 3SE +/- 1.57, N = 3SE +/- 15.53, N = 3SE +/- 1.60, N = 3SE +/- 27.37, N = 34296.24308.34290.54309.24316.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 16.10, N = 3SE +/- 12.50, N = 3SE +/- 15.04, N = 3SE +/- 2.84, N = 3SE +/- 22.16, N = 34320.44344.44345.74340.14285.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v28001600240032004000SE +/- 4.06, N = 3SE +/- 7.88, N = 3SE +/- 3.71, N = 3SE +/- 31.53, N = 3SE +/- 9.25, N = 33855.83835.53861.63812.53848.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v28001600240032004000SE +/- 23.34, N = 3SE +/- 13.35, N = 3SE +/- 7.11, N = 3SE +/- 17.09, N = 3SE +/- 23.89, N = 33660.63686.73693.33671.33634.9

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v28001600240032004000SE +/- 10.97, N = 3SE +/- 13.13, N = 3SE +/- 10.93, N = 3SE +/- 38.03, N = 5SE +/- 41.40, N = 53780.73773.33797.53679.23736.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 5.54, N = 3SE +/- 2.87, N = 3SE +/- 19.30, N = 3SE +/- 8.01, N = 3SE +/- 59.35, N = 34285.64282.94270.04233.74178.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v29001800270036004500SE +/- 6.96, N = 3SE +/- 6.84, N = 3SE +/- 8.12, N = 3SE +/- 18.65, N = 3SE +/- 11.79, N = 34055.64051.34055.04046.94046.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v28001600240032004000SE +/- 17.36, N = 3SE +/- 8.63, N = 3SE +/- 13.90, N = 3SE +/- 1.10, N = 3SE +/- 8.30, N = 33914.83895.43912.13909.53907.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v27001400210028003500SE +/- 15.08, N = 3SE +/- 13.41, N = 3SE +/- 38.36, N = 3SE +/- 12.20, N = 3SE +/- 13.57, N = 33123.13091.63107.23053.13048.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v26001200180024003000SE +/- 3.24, N = 3SE +/- 8.21, N = 3SE +/- 12.56, N = 3SE +/- 6.15, N = 3SE +/- 5.49, N = 32706.62723.72736.02662.02642.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v26001200180024003000SE +/- 5.74, N = 3SE +/- 14.23, N = 3SE +/- 16.56, N = 3SE +/- 16.21, N = 3SE +/- 8.09, N = 32751.42756.02738.92231.12239.3

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v216003200480064008000SE +/- 5.79, N = 3SE +/- 79.17, N = 3SE +/- 35.65, N = 3SE +/- 24.17, N = 3SE +/- 64.34, N = 157287.07311.47347.46805.06925.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v22K4K6K8K10KSE +/- 172.96, N = 15SE +/- 99.10, N = 3SE +/- 114.62, N = 3SE +/- 33.87, N = 3SE +/- 54.50, N = 39870.610350.010470.010131.010229.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v24K8K12K16K20KSE +/- 98.77, N = 3SE +/- 136.41, N = 3SE +/- 139.56, N = 3SE +/- 131.37, N = 3SE +/- 107.35, N = 31644016548165261347113394

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 26.91, N = 3SE +/- 70.08, N = 3SE +/- 133.36, N = 3SE +/- 41.97, N = 3SE +/- 70.91, N = 31498215878159521350213636

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 136.03, N = 15SE +/- 160.95, N = 3SE +/- 33.80, N = 3SE +/- 84.00, N = 3SE +/- 136.77, N = 31270213026132061278912740

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 98.34, N = 3SE +/- 30.75, N = 3SE +/- 28.45, N = 3SE +/- 68.09, N = 3SE +/- 67.57, N = 31367913989140321365113831

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 123.58, N = 3SE +/- 81.91, N = 3SE +/- 120.34, N = 3SE +/- 157.18, N = 3SE +/- 127.67, N = 31555215560154711528915270

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 84.15, N = 12SE +/- 95.52, N = 3SE +/- 92.80, N = 3SE +/- 113.85, N = 15SE +/- 82.96, N = 151172911743119521218812295

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 256debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v22K4K6K8K10KSE +/- 86.71, N = 3SE +/- 85.30, N = 3SE +/- 40.95, N = 3SE +/- 115.75, N = 3SE +/- 119.21, N = 31141411399115111109411087

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 97.11, N = 3SE +/- 84.01, N = 3SE +/- 19.97, N = 3SE +/- 64.54, N = 3SE +/- 37.37, N = 31187011803117501171411715

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 80.85, N = 3SE +/- 67.72, N = 3SE +/- 80.68, N = 3SE +/- 43.66, N = 3SE +/- 112.78, N = 31626616141160591563515517

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 41.31, N = 3SE +/- 75.96, N = 3SE +/- 78.24, N = 3SE +/- 24.95, N = 3SE +/- 107.49, N = 31551315461154571554315539

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 193.73, N = 3SE +/- 80.21, N = 3SE +/- 135.86, N = 3SE +/- 19.46, N = 3SE +/- 82.03, N = 31409114186143531424314259

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v23K6K9K12K15KSE +/- 644.68, N = 12SE +/- 81.04, N = 3SE +/- 76.28, N = 3SE +/- 124.86, N = 3SE +/- 40.01, N = 310890.712316.012221.011903.011906.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v215003000450060007500SE +/- 44.49, N = 3SE +/- 11.97, N = 3SE +/- 8.17, N = 3SE +/- 33.67, N = 3SE +/- 41.03, N = 36459.17027.07077.76728.26665.9

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v214002800420056007000SE +/- 262.37, N = 9SE +/- 13.11, N = 3SE +/- 21.46, N = 3SE +/- 4.28, N = 3SE +/- 8.82, N = 36092.76717.96717.15603.75618.5