fftw rocket lake Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0703 BIOS) and AMD Radeon RX 6800/6800 XT / 6900 16GB on Ubuntu 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2104046-IB-FFTWROCKE77&rdt&grw .
fftw rocket lake Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 1 2 3 Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads) ASUS ROG MAXIMUS XIII HERO (0703 BIOS) Intel Tiger Lake-H 32GB 1000GB Western Digital WD_BLACK SN850 1TB AMD Radeon RX 6800/6800 XT / 6900 16GB (2575/1000MHz) Intel Tiger Lake-H HD Audio ASUS MG28U 2 x Intel I225-V + Intel Device 2725 Ubuntu 21.04 5.12.0-051200rc3daily20210315-generic (x86_64) 20210314 GNOME Shell 3.38.3 X Server 1.20.10 + Wayland 4.6 Mesa 21.1.0-devel (git-616720d 2021-03-16 hirsute-oibaf-ppa) (LLVM 12.0.0) GCC 10.2.1 20210320 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - DEBUGINFOD_URLS= Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-DjbZbO/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-DjbZbO/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.3 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
fftw rocket lake fftw: Stock - 1D FFT Size 32 fftw: Stock - 1D FFT Size 64 fftw: Stock - 2D FFT Size 32 fftw: Stock - 2D FFT Size 64 fftw: Stock - 1D FFT Size 128 fftw: Stock - 1D FFT Size 256 fftw: Stock - 1D FFT Size 512 fftw: Stock - 2D FFT Size 128 fftw: Stock - 2D FFT Size 256 fftw: Stock - 2D FFT Size 512 fftw: Stock - 1D FFT Size 1024 fftw: Stock - 1D FFT Size 2048 fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 1024 fftw: Stock - 2D FFT Size 2048 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 32 fftw: Float + SSE - 1D FFT Size 64 fftw: Float + SSE - 2D FFT Size 32 fftw: Float + SSE - 2D FFT Size 64 fftw: Float + SSE - 1D FFT Size 128 fftw: Float + SSE - 1D FFT Size 256 fftw: Float + SSE - 1D FFT Size 512 fftw: Float + SSE - 2D FFT Size 128 fftw: Float + SSE - 2D FFT Size 256 fftw: Float + SSE - 2D FFT Size 512 fftw: Float + SSE - 1D FFT Size 1024 fftw: Float + SSE - 1D FFT Size 2048 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 1024 fftw: Float + SSE - 2D FFT Size 2048 fftw: Float + SSE - 2D FFT Size 4096 1 2 3 11822.9 12109 13667 11418 10487 11047 11014 11070 10249 10036.8 11269 10542 10451 9412.4 8175.4 7528.4 19056 31301 61915 53842 38631 55392 76670 46326 39016 43530 92791 90101 85418 42377 31197 27931 11669 12083 13709 11525 10414.1 11103 11061 10921 10263 10049 10981 10568 10441 9381.2 7963.6 7504.5 19631 32206 63844 54924 39353 54070 75084 46515 39158 44137 93009 92597 86204 41954 30854 27265 11765 12300 13814 11375 10528.9 11092 11026 10984 10214 10095 11257 10580 10372 9472.9 8060.2 7453.9 19484 31466 65903 55547 38590 54242 75469 46640 39235 43846 92756 88798 85034 41964 30997 27817 OpenBenchmarking.org
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 1 2 3 3K 6K 9K 12K 15K SE +/- 253.18, N = 15 SE +/- 263.60, N = 15 SE +/- 225.97, N = 15 11822.9 11669.0 11765.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 64 1 2 3 3K 6K 9K 12K 15K SE +/- 124.67, N = 15 SE +/- 106.96, N = 15 SE +/- 21.40, N = 3 12109 12083 12300 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 1 2 3 3K 6K 9K 12K 15K SE +/- 59.30, N = 3 SE +/- 30.81, N = 3 SE +/- 8.35, N = 3 13667 13709 13814 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 64 1 2 3 2K 4K 6K 8K 10K SE +/- 155.24, N = 3 SE +/- 96.48, N = 8 SE +/- 109.06, N = 3 11418 11525 11375 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 128 1 2 3 2K 4K 6K 8K 10K SE +/- 25.86, N = 3 SE +/- 114.56, N = 15 SE +/- 126.97, N = 15 10487.0 10414.1 10528.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 256 1 2 3 2K 4K 6K 8K 10K SE +/- 42.90, N = 3 SE +/- 38.73, N = 3 SE +/- 40.76, N = 3 11047 11103 11092 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 512 1 2 3 2K 4K 6K 8K 10K SE +/- 42.51, N = 3 SE +/- 81.88, N = 3 SE +/- 105.08, N = 3 11014 11061 11026 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 128 1 2 3 2K 4K 6K 8K 10K SE +/- 48.64, N = 3 SE +/- 114.87, N = 3 SE +/- 75.94, N = 3 11070 10921 10984 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 256 1 2 3 2K 4K 6K 8K 10K SE +/- 39.79, N = 3 SE +/- 10.41, N = 3 SE +/- 25.24, N = 3 10249 10263 10214 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 512 1 2 3 2K 4K 6K 8K 10K SE +/- 29.47, N = 3 SE +/- 39.41, N = 3 SE +/- 21.73, N = 3 10036.8 10049.0 10095.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 1 2 3 2K 4K 6K 8K 10K SE +/- 25.24, N = 3 SE +/- 105.23, N = 3 SE +/- 47.13, N = 3 11269 10981 11257 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 2048 1 2 3 2K 4K 6K 8K 10K SE +/- 40.07, N = 3 SE +/- 77.57, N = 3 SE +/- 17.03, N = 3 10542 10568 10580 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 1 2 3 2K 4K 6K 8K 10K SE +/- 68.49, N = 3 SE +/- 37.68, N = 3 SE +/- 26.39, N = 3 10451 10441 10372 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 1 2 3 2K 4K 6K 8K 10K SE +/- 31.88, N = 3 SE +/- 94.34, N = 3 SE +/- 84.69, N = 3 9412.4 9381.2 9472.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 1 2 3 2K 4K 6K 8K 10K SE +/- 65.64, N = 3 SE +/- 89.29, N = 3 SE +/- 40.32, N = 3 8175.4 7963.6 8060.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 1 2 3 1600 3200 4800 6400 8000 SE +/- 6.74, N = 3 SE +/- 20.15, N = 3 SE +/- 24.74, N = 3 7528.4 7504.5 7453.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 1 2 3 4K 8K 12K 16K 20K SE +/- 315.90, N = 12 SE +/- 63.46, N = 3 SE +/- 136.51, N = 15 19056 19631 19484 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 64 1 2 3 7K 14K 21K 28K 35K SE +/- 316.86, N = 15 SE +/- 393.56, N = 3 SE +/- 371.82, N = 15 31301 32206 31466 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 1 2 3 14K 28K 42K 56K 70K SE +/- 1809.21, N = 15 SE +/- 1087.48, N = 12 SE +/- 323.75, N = 3 61915 63844 65903 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 64 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 64 1 2 3 12K 24K 36K 48K 60K SE +/- 1307.69, N = 12 SE +/- 1037.57, N = 15 SE +/- 1112.00, N = 12 53842 54924 55547 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 128 1 2 3 8K 16K 24K 32K 40K SE +/- 550.52, N = 15 SE +/- 332.75, N = 15 SE +/- 540.19, N = 3 38631 39353 38590 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 256 1 2 3 12K 24K 36K 48K 60K SE +/- 531.99, N = 15 SE +/- 258.94, N = 3 SE +/- 420.25, N = 3 55392 54070 54242 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 512 1 2 3 16K 32K 48K 64K 80K SE +/- 543.81, N = 15 SE +/- 544.39, N = 3 SE +/- 892.90, N = 3 76670 75084 75469 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 128 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 128 1 2 3 10K 20K 30K 40K 50K SE +/- 414.98, N = 3 SE +/- 318.43, N = 3 SE +/- 46.53, N = 3 46326 46515 46640 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 256 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 256 1 2 3 8K 16K 24K 32K 40K SE +/- 197.71, N = 3 SE +/- 202.84, N = 3 SE +/- 507.24, N = 3 39016 39158 39235 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 512 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 512 1 2 3 9K 18K 27K 36K 45K SE +/- 229.96, N = 3 SE +/- 107.64, N = 3 SE +/- 53.54, N = 3 43530 44137 43846 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 1024 1 2 3 20K 40K 60K 80K 100K SE +/- 499.12, N = 3 SE +/- 924.94, N = 3 SE +/- 933.06, N = 3 92791 93009 92756 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 1 2 3 20K 40K 60K 80K 100K SE +/- 745.03, N = 3 SE +/- 578.38, N = 3 SE +/- 729.50, N = 3 90101 92597 88798 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 1 2 3 20K 40K 60K 80K 100K SE +/- 414.09, N = 3 SE +/- 42.51, N = 3 SE +/- 537.34, N = 3 85418 86204 85034 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 1024 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 1024 1 2 3 9K 18K 27K 36K 45K SE +/- 186.76, N = 3 SE +/- 68.67, N = 3 SE +/- 109.36, N = 3 42377 41954 41964 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 1 2 3 7K 14K 21K 28K 35K SE +/- 116.56, N = 3 SE +/- 116.92, N = 3 SE +/- 151.77, N = 3 31197 30854 30997 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 1 2 3 6K 12K 18K 24K 30K SE +/- 52.31, N = 3 SE +/- 164.26, N = 3 SE +/- 363.01, N = 3 27931 27265 27817 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Phoronix Test Suite v10.8.5