fftw rocket lake Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0703 BIOS) and AMD Radeon RX 6800/6800 XT / 6900 16GB on Ubuntu 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2104046-IB-FFTWROCKE77&grr&sro .
fftw rocket lake Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads) ASUS ROG MAXIMUS XIII HERO (0703 BIOS) Intel Tiger Lake-H 32GB 1000GB Western Digital WD_BLACK SN850 1TB AMD Radeon RX 6800/6800 XT / 6900 16GB (2575/1000MHz) Intel Tiger Lake-H HD Audio ASUS MG28U 2 x Intel I225-V + Intel Device 2725 Ubuntu 21.04 5.12.0-051200rc3daily20210315-generic (x86_64) 20210314 GNOME Shell 3.38.3 X Server 1.20.10 + Wayland 4.6 Mesa 21.1.0-devel (git-616720d 2021-03-16 hirsute-oibaf-ppa) (LLVM 12.0.0) GCC 10.2.1 20210320 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - DEBUGINFOD_URLS= Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-DjbZbO/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-DjbZbO/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.3 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
fftw rocket lake fftw: Float + SSE - 2D FFT Size 4096 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 2048 fftw: Stock - 2D FFT Size 2048 fftw: Stock - 1D FFT Size 32 fftw: Float + SSE - 2D FFT Size 64 fftw: Float + SSE - 2D FFT Size 1024 fftw: Float + SSE - 1D FFT Size 64 fftw: Float + SSE - 1D FFT Size 128 fftw: Stock - 1D FFT Size 128 fftw: Float + SSE - 1D FFT Size 512 fftw: Float + SSE - 2D FFT Size 32 fftw: Stock - 2D FFT Size 1024 fftw: Stock - 1D FFT Size 64 fftw: Float + SSE - 1D FFT Size 32 fftw: Float + SSE - 1D FFT Size 256 fftw: Float + SSE - 2D FFT Size 512 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 1024 fftw: Stock - 2D FFT Size 512 fftw: Float + SSE - 2D FFT Size 256 fftw: Float + SSE - 1D FFT Size 2048 fftw: Stock - 1D FFT Size 1024 fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 256 fftw: Stock - 1D FFT Size 512 fftw: Stock - 2D FFT Size 64 fftw: Stock - 1D FFT Size 2048 fftw: Stock - 1D FFT Size 256 fftw: Stock - 2D FFT Size 32 fftw: Float + SSE - 2D FFT Size 128 fftw: Stock - 2D FFT Size 128 1 2 3 27931 7528.4 31197 8175.4 11822.9 53842 42377 31301 38631 10487 76670 61915 9412.4 12109 19056 55392 43530 85418 92791 10036.8 39016 90101 11269 10451 10249 11014 11418 10542 11047 13667 46326 11070 27265 7504.5 30854 7963.6 11669 54924 41954 32206 39353 10414.1 75084 63844 9381.2 12083 19631 54070 44137 86204 93009 10049 39158 92597 10981 10441 10263 11061 11525 10568 11103 13709 46515 10921 27817 7453.9 30997 8060.2 11765 55547 41964 31466 38590 10528.9 75469 65903 9472.9 12300 19484 54242 43846 85034 92756 10095 39235 88798 11257 10372 10214 11026 11375 10580 11092 13814 46640 10984 OpenBenchmarking.org
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 1 2 3 6K 12K 18K 24K 30K SE +/- 52.31, N = 3 SE +/- 164.26, N = 3 SE +/- 363.01, N = 3 27931 27265 27817 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 1 2 3 1600 3200 4800 6400 8000 SE +/- 6.74, N = 3 SE +/- 20.15, N = 3 SE +/- 24.74, N = 3 7528.4 7504.5 7453.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 1 2 3 7K 14K 21K 28K 35K SE +/- 116.56, N = 3 SE +/- 116.92, N = 3 SE +/- 151.77, N = 3 31197 30854 30997 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 1 2 3 2K 4K 6K 8K 10K SE +/- 65.64, N = 3 SE +/- 89.29, N = 3 SE +/- 40.32, N = 3 8175.4 7963.6 8060.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 1 2 3 3K 6K 9K 12K 15K SE +/- 253.18, N = 15 SE +/- 263.60, N = 15 SE +/- 225.97, N = 15 11822.9 11669.0 11765.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 64 1 2 3 12K 24K 36K 48K 60K SE +/- 1307.69, N = 12 SE +/- 1037.57, N = 15 SE +/- 1112.00, N = 12 53842 54924 55547 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 1024 1 2 3 9K 18K 27K 36K 45K SE +/- 186.76, N = 3 SE +/- 68.67, N = 3 SE +/- 109.36, N = 3 42377 41954 41964 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 64 1 2 3 7K 14K 21K 28K 35K SE +/- 316.86, N = 15 SE +/- 393.56, N = 3 SE +/- 371.82, N = 15 31301 32206 31466 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 128 1 2 3 8K 16K 24K 32K 40K SE +/- 550.52, N = 15 SE +/- 332.75, N = 15 SE +/- 540.19, N = 3 38631 39353 38590 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 128 1 2 3 2K 4K 6K 8K 10K SE +/- 25.86, N = 3 SE +/- 114.56, N = 15 SE +/- 126.97, N = 15 10487.0 10414.1 10528.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 512 1 2 3 16K 32K 48K 64K 80K SE +/- 543.81, N = 15 SE +/- 544.39, N = 3 SE +/- 892.90, N = 3 76670 75084 75469 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 1 2 3 14K 28K 42K 56K 70K SE +/- 1809.21, N = 15 SE +/- 1087.48, N = 12 SE +/- 323.75, N = 3 61915 63844 65903 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 1 2 3 2K 4K 6K 8K 10K SE +/- 31.88, N = 3 SE +/- 94.34, N = 3 SE +/- 84.69, N = 3 9412.4 9381.2 9472.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 64 1 2 3 3K 6K 9K 12K 15K SE +/- 124.67, N = 15 SE +/- 106.96, N = 15 SE +/- 21.40, N = 3 12109 12083 12300 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 1 2 3 4K 8K 12K 16K 20K SE +/- 315.90, N = 12 SE +/- 63.46, N = 3 SE +/- 136.51, N = 15 19056 19631 19484 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 256 1 2 3 12K 24K 36K 48K 60K SE +/- 531.99, N = 15 SE +/- 258.94, N = 3 SE +/- 420.25, N = 3 55392 54070 54242 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 512 1 2 3 9K 18K 27K 36K 45K SE +/- 229.96, N = 3 SE +/- 107.64, N = 3 SE +/- 53.54, N = 3 43530 44137 43846 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 1 2 3 20K 40K 60K 80K 100K SE +/- 414.09, N = 3 SE +/- 42.51, N = 3 SE +/- 537.34, N = 3 85418 86204 85034 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 1024 1 2 3 20K 40K 60K 80K 100K SE +/- 499.12, N = 3 SE +/- 924.94, N = 3 SE +/- 933.06, N = 3 92791 93009 92756 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 512 1 2 3 2K 4K 6K 8K 10K SE +/- 29.47, N = 3 SE +/- 39.41, N = 3 SE +/- 21.73, N = 3 10036.8 10049.0 10095.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 256 1 2 3 8K 16K 24K 32K 40K SE +/- 197.71, N = 3 SE +/- 202.84, N = 3 SE +/- 507.24, N = 3 39016 39158 39235 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 1 2 3 20K 40K 60K 80K 100K SE +/- 745.03, N = 3 SE +/- 578.38, N = 3 SE +/- 729.50, N = 3 90101 92597 88798 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 1 2 3 2K 4K 6K 8K 10K SE +/- 25.24, N = 3 SE +/- 105.23, N = 3 SE +/- 47.13, N = 3 11269 10981 11257 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 1 2 3 2K 4K 6K 8K 10K SE +/- 68.49, N = 3 SE +/- 37.68, N = 3 SE +/- 26.39, N = 3 10451 10441 10372 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 256 1 2 3 2K 4K 6K 8K 10K SE +/- 39.79, N = 3 SE +/- 10.41, N = 3 SE +/- 25.24, N = 3 10249 10263 10214 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 512 1 2 3 2K 4K 6K 8K 10K SE +/- 42.51, N = 3 SE +/- 81.88, N = 3 SE +/- 105.08, N = 3 11014 11061 11026 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 64 1 2 3 2K 4K 6K 8K 10K SE +/- 155.24, N = 3 SE +/- 96.48, N = 8 SE +/- 109.06, N = 3 11418 11525 11375 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 2048 1 2 3 2K 4K 6K 8K 10K SE +/- 40.07, N = 3 SE +/- 77.57, N = 3 SE +/- 17.03, N = 3 10542 10568 10580 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 256 1 2 3 2K 4K 6K 8K 10K SE +/- 42.90, N = 3 SE +/- 38.73, N = 3 SE +/- 40.76, N = 3 11047 11103 11092 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 1 2 3 3K 6K 9K 12K 15K SE +/- 59.30, N = 3 SE +/- 30.81, N = 3 SE +/- 8.35, N = 3 13667 13709 13814 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 128 1 2 3 10K 20K 30K 40K 50K SE +/- 414.98, N = 3 SE +/- 318.43, N = 3 SE +/- 46.53, N = 3 46326 46515 46640 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 128 1 2 3 2K 4K 6K 8K 10K SE +/- 48.64, N = 3 SE +/- 114.87, N = 3 SE +/- 75.94, N = 3 11070 10921 10984 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Phoronix Test Suite v10.8.5