fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2407171-NE-FFTW120RU32
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
debug
July 13
  1 Hour, 54 Minutes
NoGVNO2
July 14
  57 Minutes
NoGVNO3
July 15
  57 Minutes
OptNoSimplO2
July 16
  17 Minutes
2 x Intel Xeon E5-2620 v2
July 17
  17 Minutes
Invert Hiding All Results Option
  52 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


fftw-1.2.0runOpenBenchmarking.orgPhoronix Test Suite2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionFftw-1.2.0run BenchmarksSystem Logs- Transparent Huge Pages: always- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" - NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" - OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" - 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" - Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686 - Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42e- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2Result OverviewPhoronix Test Suite100%106%112%118%FFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFFTWFloat + SSE - 2D FFT Size 32Stock - 2D FFT Size 4096Float + SSE - 2D FFT Size 4096Float + SSE - 2D FFT Size 64Float + SSE - 2D FFT Size 1024Float + SSE - 2D FFT Size 2048Float + SSE - 1D FFT Size 32Float + SSE - 1D FFT Size 64Float + SSE - 1D FFT Size 1024Float + SSE - 2D FFT Size 128Float + SSE - 1D FFT Size 128Float + SSE - 2D FFT Size 256Stock - 2D FFT Size 2048Stock - 2D FFT Size 512Float + SSE - 1D FFT Size 256Stock - 1D FFT Size 1024Stock - 2D FFT Size 1024Stock - 2D FFT Size 32Float + SSE - 1D FFT Size 512Float + SSE - 1D FFT Size 4096Stock - 2D FFT Size 256Stock - 1D FFT Size 128Stock - 1D FFT Size 512Float + SSE - 2D FFT Size 512Stock - 2D FFT Size 128Stock - 1D FFT Size 64Stock - 1D FFT Size 32Stock - 2D FFT Size 64Stock - 1D FFT Size 256Float + SSE - 1D FFT Size 2048Stock - 1D FFT Size 4096Stock - 1D FFT Size 2048

fftw-1.2.0runfftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 128fftw: Stock - 2D FFT Size 256fftw: Stock - 2D FFT Size 512fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v24998.54906.85218.74309.24420.64296.24320.43855.83660.63780.74285.64055.63914.83123.12706.62751.47287.09870.6164401498212702136791555211729114141187016266155131409110890.76459.16092.74967.14907.75211.54336.74414.64308.34344.43835.53686.73773.34282.94051.33895.43091.62723.72756.07311.4103501654815878130261398915560117431139911803161411546114186123167027.06717.95020.84897.95219.04289.84379.54290.54345.73861.63693.33797.54270.04055.03912.13107.22736.02738.97347.4104701652615952132061403215471119521151111750160591545714353122217077.76717.15008.94846.95127.94337.14358.84309.24340.13812.53671.33679.24233.74046.93909.53053.12662.02231.16805.0101311347113502127891365115289121881109411714156351554314243119036728.25603.75024.64899.55115.04336.64354.34316.14285.83848.03634.93736.54178.44046.43907.03048.52642.12239.36925.7102291339413636127401383115270122951108711715155171553914259119066665.95618.5OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug11002200330044005500SE +/- 4.74, N = 3SE +/- 17.04, N = 3SE +/- 23.10, N = 3SE +/- 8.35, N = 3SE +/- 29.89, N = 35024.65008.95020.84967.14998.5

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 642 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug11002200330044005500SE +/- 7.41, N = 3SE +/- 61.09, N = 3SE +/- 5.21, N = 3SE +/- 3.97, N = 3SE +/- 2.78, N = 34899.54846.94897.94907.74906.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug11002200330044005500SE +/- 6.03, N = 3SE +/- 43.81, N = 3SE +/- 6.35, N = 3SE +/- 12.52, N = 3SE +/- 7.51, N = 35115.05127.95219.05211.55218.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 642 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug9001800270036004500SE +/- 5.04, N = 3SE +/- 4.58, N = 3SE +/- 46.52, N = 4SE +/- 4.40, N = 3SE +/- 21.54, N = 34336.64337.14289.84336.74309.2

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1282 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug9001800270036004500SE +/- 3.10, N = 3SE +/- 1.43, N = 3SE +/- 35.10, N = 9SE +/- 2.46, N = 3SE +/- 6.43, N = 34354.34358.84379.54414.64420.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2562 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug9001800270036004500SE +/- 27.37, N = 3SE +/- 1.60, N = 3SE +/- 15.53, N = 3SE +/- 1.57, N = 3SE +/- 1.72, N = 34316.14309.24290.54308.34296.2

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug9001800270036004500SE +/- 22.16, N = 3SE +/- 2.84, N = 3SE +/- 15.04, N = 3SE +/- 12.50, N = 3SE +/- 16.10, N = 34285.84340.14345.74344.44320.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1282 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug8001600240032004000SE +/- 9.25, N = 3SE +/- 31.53, N = 3SE +/- 3.71, N = 3SE +/- 7.88, N = 3SE +/- 4.06, N = 33848.03812.53861.63835.53855.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2562 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug8001600240032004000SE +/- 23.89, N = 3SE +/- 17.09, N = 3SE +/- 7.11, N = 3SE +/- 13.35, N = 3SE +/- 23.34, N = 33634.93671.33693.33686.73660.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 5122 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug8001600240032004000SE +/- 41.40, N = 5SE +/- 38.03, N = 5SE +/- 10.93, N = 3SE +/- 13.13, N = 3SE +/- 10.97, N = 33736.53679.23797.53773.33780.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug9001800270036004500SE +/- 59.35, N = 3SE +/- 8.01, N = 3SE +/- 19.30, N = 3SE +/- 2.87, N = 3SE +/- 5.54, N = 34178.44233.74270.04282.94285.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 20482 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug9001800270036004500SE +/- 11.79, N = 3SE +/- 18.65, N = 3SE +/- 8.12, N = 3SE +/- 6.84, N = 3SE +/- 6.96, N = 34046.44046.94055.04051.34055.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 40962 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug8001600240032004000SE +/- 8.30, N = 3SE +/- 1.10, N = 3SE +/- 13.90, N = 3SE +/- 8.63, N = 3SE +/- 17.36, N = 33907.03909.53912.13895.43914.8

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 10242 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug7001400210028003500SE +/- 13.57, N = 3SE +/- 12.20, N = 3SE +/- 38.36, N = 3SE +/- 13.41, N = 3SE +/- 15.08, N = 33048.53053.13107.23091.63123.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug6001200180024003000SE +/- 5.49, N = 3SE +/- 6.15, N = 3SE +/- 12.56, N = 3SE +/- 8.21, N = 3SE +/- 3.24, N = 32642.12662.02736.02723.72706.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 40962 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug6001200180024003000SE +/- 8.09, N = 3SE +/- 16.21, N = 3SE +/- 16.56, N = 3SE +/- 14.23, N = 3SE +/- 5.74, N = 32239.32231.12738.92756.02751.4

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 322 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug16003200480064008000SE +/- 64.34, N = 15SE +/- 24.17, N = 3SE +/- 35.65, N = 3SE +/- 79.17, N = 3SE +/- 5.79, N = 36925.76805.07347.47311.47287.0

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 642 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug2K4K6K8K10KSE +/- 54.50, N = 3SE +/- 33.87, N = 3SE +/- 114.62, N = 3SE +/- 99.10, N = 3SE +/- 172.96, N = 1510229.010131.010470.010350.09870.6

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug4K8K12K16K20KSE +/- 107.35, N = 3SE +/- 131.37, N = 3SE +/- 139.56, N = 3SE +/- 136.41, N = 3SE +/- 98.77, N = 31339413471165261654816440

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 642 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 70.91, N = 3SE +/- 41.97, N = 3SE +/- 133.36, N = 3SE +/- 70.08, N = 3SE +/- 26.91, N = 31363613502159521587814982

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1282 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 136.77, N = 3SE +/- 84.00, N = 3SE +/- 33.80, N = 3SE +/- 160.95, N = 3SE +/- 136.03, N = 151274012789132061302612702

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2562 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 67.57, N = 3SE +/- 68.09, N = 3SE +/- 28.45, N = 3SE +/- 30.75, N = 3SE +/- 98.34, N = 31383113651140321398913679

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 5122 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 127.67, N = 3SE +/- 157.18, N = 3SE +/- 120.34, N = 3SE +/- 81.91, N = 3SE +/- 123.58, N = 31527015289154711556015552

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1282 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 82.96, N = 15SE +/- 113.85, N = 15SE +/- 92.80, N = 3SE +/- 95.52, N = 3SE +/- 84.15, N = 121229512188119521174311729

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug2K4K6K8K10KSE +/- 119.21, N = 3SE +/- 115.75, N = 3SE +/- 40.95, N = 3SE +/- 85.30, N = 3SE +/- 86.71, N = 31108711094115111139911414

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 5122 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 37.37, N = 3SE +/- 64.54, N = 3SE +/- 19.97, N = 3SE +/- 84.01, N = 3SE +/- 97.11, N = 31171511714117501180311870

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 112.78, N = 3SE +/- 43.66, N = 3SE +/- 80.68, N = 3SE +/- 67.72, N = 3SE +/- 80.85, N = 31551715635160591614116266

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 20482 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 107.49, N = 3SE +/- 24.95, N = 3SE +/- 78.24, N = 3SE +/- 75.96, N = 3SE +/- 41.31, N = 31553915543154571546115513

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 40962 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 82.03, N = 3SE +/- 19.46, N = 3SE +/- 135.86, N = 3SE +/- 80.21, N = 3SE +/- 193.73, N = 31425914243143531418614091

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 10242 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug3K6K9K12K15KSE +/- 40.01, N = 3SE +/- 124.86, N = 3SE +/- 76.28, N = 3SE +/- 81.04, N = 3SE +/- 644.68, N = 1211906.011903.012221.012316.010890.7

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 20482 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug15003000450060007500SE +/- 41.03, N = 3SE +/- 33.67, N = 3SE +/- 8.17, N = 3SE +/- 11.97, N = 3SE +/- 44.49, N = 36665.96728.27077.77027.06459.1

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 40962 x Intel Xeon E5-2620 v2OptNoSimplO2NoGVNO3NoGVNO2debug14002800420056007000SE +/- 8.82, N = 3SE +/- 4.28, N = 3SE +/- 21.46, N = 3SE +/- 13.11, N = 3SE +/- 262.37, N = 95618.55603.76717.16717.96092.7