fftw-1.2.0run

2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2407174-NE-FFTW120RU04&sro&grs.

fftw-1.2.0runProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutiondebugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v22 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads)ASUS Z9PE-D8 WS (5503 BIOS)Intel Xeon E7 v2/Xeon32GB256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00PASPEEDRealtek ALC8982 x Intel 82574LCentOS Stream 95.14.0-467.el9.x86_64 (x86_64)X ServerNVIDIAGCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2ext41024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- debug: CXXFLAGS=-O2 CFLAGS=-O2- NoGVNO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- NoGVNO3: CXXFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true" CFLAGS="-O3 -mllvm -enable-newgvn -mllvm -disable-gvn=true"- OptNoSimplO2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" - 2 x Intel Xeon E5-2620 v2: CXXFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" CFLAGS="-O2 -mllvm -enable-newgvn -mllvm -enable-phi-of-ops=false -mllvm -enable-newgvn-pre=false -mllvm -enable-newgvn-simpl=false" Compiler Details- Optimized build with assertions; Built Apr 11 2013 (07:43:48); Default target: i386-pc-linux-gnu; Host CPU: i686Processor Details- Scaling Governor: intel_cpufreq conservative - CPU Microcode: 0x42eSecurity Details- gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

fftw-1.2.0runfftw: Float + SSE - 2D FFT Size 32fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 256fftw: Stock - 1D FFT Size 1024fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 1D FFT Size 4096fftw: Stock - 2D FFT Size 256fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 512fftw: Stock - 2D FFT Size 128fftw: Stock - 1D FFT Size 64fftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 64debugNoGVNO2NoGVNO3OptNoSimplO22 x Intel Xeon E5-2620 v2164402751.4149826459.17287.0162661172912702114142706.63780.7136794285.63123.15218.715552140913660.64420.64320.4118703855.84906.84998.54309.24296.2155133914.84055.66092.710890.79870.6165482756.0158787027.07311.4161411174313026113992723.73773.3139894282.93091.65211.515560141863686.74414.64344.4118033835.54907.74967.14336.74308.3154613895.44051.36717.91231610350165262738.9159527077.77347.4160591195213206115112736.03797.5140324270.03107.25219.015471143533693.34379.54345.7117503861.64897.95020.84289.84290.5154573912.14055.06717.11222110470134712231.1135026728.26805.0156351218812789110942662.03679.2136514233.73053.15127.915289142433671.34358.84340.1117143812.54846.95008.94337.14309.2155433909.54046.95603.71190310131133942239.3136366665.96925.7155171229512740110872642.13736.5138314178.43048.55115.015270142593634.94354.34285.8117153848.04899.55024.64336.64316.1155393907.04046.45618.51190610229OpenBenchmarking.org

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug4K8K12K16K20KSE +/- 107.35, N = 3SE +/- 136.41, N = 3SE +/- 139.56, N = 3SE +/- 131.37, N = 3SE +/- 98.77, N = 31339416548165261347116440

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 40962 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug6001200180024003000SE +/- 8.09, N = 3SE +/- 14.23, N = 3SE +/- 16.56, N = 3SE +/- 16.21, N = 3SE +/- 5.74, N = 32239.32756.02738.92231.12751.4

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 642 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 70.91, N = 3SE +/- 70.08, N = 3SE +/- 133.36, N = 3SE +/- 41.97, N = 3SE +/- 26.91, N = 31363615878159521350214982

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 20482 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug15003000450060007500SE +/- 41.03, N = 3SE +/- 11.97, N = 3SE +/- 8.17, N = 3SE +/- 33.67, N = 3SE +/- 44.49, N = 36665.97027.07077.76728.26459.1

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 322 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug16003200480064008000SE +/- 64.34, N = 15SE +/- 79.17, N = 3SE +/- 35.65, N = 3SE +/- 24.17, N = 3SE +/- 5.79, N = 36925.77311.47347.46805.07287.0

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 112.78, N = 3SE +/- 67.72, N = 3SE +/- 80.68, N = 3SE +/- 43.66, N = 3SE +/- 80.85, N = 31551716141160591563516266

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1282 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 82.96, N = 15SE +/- 95.52, N = 3SE +/- 92.80, N = 3SE +/- 113.85, N = 15SE +/- 84.15, N = 121229511743119521218811729

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1282 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 136.77, N = 3SE +/- 160.95, N = 3SE +/- 33.80, N = 3SE +/- 84.00, N = 3SE +/- 136.03, N = 151274013026132061278912702

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug2K4K6K8K10KSE +/- 119.21, N = 3SE +/- 85.30, N = 3SE +/- 40.95, N = 3SE +/- 115.75, N = 3SE +/- 86.71, N = 31108711399115111109411414

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug6001200180024003000SE +/- 5.49, N = 3SE +/- 8.21, N = 3SE +/- 12.56, N = 3SE +/- 6.15, N = 3SE +/- 3.24, N = 32642.12723.72736.02662.02706.6

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 5122 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug8001600240032004000SE +/- 41.40, N = 5SE +/- 13.13, N = 3SE +/- 10.93, N = 3SE +/- 38.03, N = 5SE +/- 10.97, N = 33736.53773.33797.53679.23780.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2562 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 67.57, N = 3SE +/- 30.75, N = 3SE +/- 28.45, N = 3SE +/- 68.09, N = 3SE +/- 98.34, N = 31383113989140321365113679

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug9001800270036004500SE +/- 59.35, N = 3SE +/- 2.87, N = 3SE +/- 19.30, N = 3SE +/- 8.01, N = 3SE +/- 5.54, N = 34178.44282.94270.04233.74285.6

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 10242 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug7001400210028003500SE +/- 13.57, N = 3SE +/- 13.41, N = 3SE +/- 38.36, N = 3SE +/- 12.20, N = 3SE +/- 15.08, N = 33048.53091.63107.23053.13123.1

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug11002200330044005500SE +/- 6.03, N = 3SE +/- 12.52, N = 3SE +/- 6.35, N = 3SE +/- 43.81, N = 3SE +/- 7.51, N = 35115.05211.55219.05127.95218.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 5122 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 127.67, N = 3SE +/- 81.91, N = 3SE +/- 120.34, N = 3SE +/- 157.18, N = 3SE +/- 123.58, N = 31527015560154711528915552

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 40962 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 82.03, N = 3SE +/- 80.21, N = 3SE +/- 135.86, N = 3SE +/- 19.46, N = 3SE +/- 193.73, N = 31425914186143531424314091

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2562 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug8001600240032004000SE +/- 23.89, N = 3SE +/- 13.35, N = 3SE +/- 7.11, N = 3SE +/- 17.09, N = 3SE +/- 23.34, N = 33634.93686.73693.33671.33660.6

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1282 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug9001800270036004500SE +/- 3.10, N = 3SE +/- 2.46, N = 3SE +/- 35.10, N = 9SE +/- 1.43, N = 3SE +/- 6.43, N = 34354.34414.64379.54358.84420.6

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug9001800270036004500SE +/- 22.16, N = 3SE +/- 12.50, N = 3SE +/- 15.04, N = 3SE +/- 2.84, N = 3SE +/- 16.10, N = 34285.84344.44345.74340.14320.4

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 5122 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 37.37, N = 3SE +/- 84.01, N = 3SE +/- 19.97, N = 3SE +/- 64.54, N = 3SE +/- 97.11, N = 31171511803117501171411870

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1282 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug8001600240032004000SE +/- 9.25, N = 3SE +/- 7.88, N = 3SE +/- 3.71, N = 3SE +/- 31.53, N = 3SE +/- 4.06, N = 33848.03835.53861.63812.53855.8

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 642 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug11002200330044005500SE +/- 7.41, N = 3SE +/- 3.97, N = 3SE +/- 5.21, N = 3SE +/- 61.09, N = 3SE +/- 2.78, N = 34899.54907.74897.94846.94906.8

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug11002200330044005500SE +/- 4.74, N = 3SE +/- 8.35, N = 3SE +/- 23.10, N = 3SE +/- 17.04, N = 3SE +/- 29.89, N = 35024.64967.15020.85008.94998.5

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 642 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug9001800270036004500SE +/- 5.04, N = 3SE +/- 4.40, N = 3SE +/- 46.52, N = 4SE +/- 4.58, N = 3SE +/- 21.54, N = 34336.64336.74289.84337.14309.2

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2562 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug9001800270036004500SE +/- 27.37, N = 3SE +/- 1.57, N = 3SE +/- 15.53, N = 3SE +/- 1.60, N = 3SE +/- 1.72, N = 34316.14308.34290.54309.24296.2

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 20482 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 107.49, N = 3SE +/- 75.96, N = 3SE +/- 78.24, N = 3SE +/- 24.95, N = 3SE +/- 41.31, N = 31553915461154571554315513

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 40962 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug8001600240032004000SE +/- 8.30, N = 3SE +/- 8.63, N = 3SE +/- 13.90, N = 3SE +/- 1.10, N = 3SE +/- 17.36, N = 33907.03895.43912.13909.53914.8

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 20482 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug9001800270036004500SE +/- 11.79, N = 3SE +/- 6.84, N = 3SE +/- 8.12, N = 3SE +/- 18.65, N = 3SE +/- 6.96, N = 34046.44051.34055.04046.94055.6

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 40962 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug14002800420056007000SE +/- 8.82, N = 3SE +/- 13.11, N = 3SE +/- 21.46, N = 3SE +/- 4.28, N = 3SE +/- 262.37, N = 95618.56717.96717.15603.76092.7

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 10242 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug3K6K9K12K15KSE +/- 40.01, N = 3SE +/- 81.04, N = 3SE +/- 76.28, N = 3SE +/- 124.86, N = 3SE +/- 644.68, N = 1211906.012316.012221.011903.010890.7

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 642 x Intel Xeon E5-2620 v2NoGVNO2NoGVNO3OptNoSimplO2debug2K4K6K8K10KSE +/- 54.50, N = 3SE +/- 99.10, N = 3SE +/- 114.62, N = 3SE +/- 33.87, N = 3SE +/- 172.96, N = 1510229.010350.010470.010131.09870.6


Phoronix Test Suite v10.8.4