fftw + oneDNN Intel Core i9-10900K testing with a Gigabyte Z490 AORUS MASTER (F3 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2006134-NE-FFTWONEDN92 Intel Core i9-10900K Processor: Intel Core i9-10900K @ 5.30GHz (10 Cores / 20 Threads), Motherboard: Gigabyte Z490 AORUS MASTER (F3 BIOS), Chipset: Intel Comet Lake PCH, Memory: 16GB, Disk: Samsung SSD 970 EVO 250GB, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: Realtek ALC1220, Monitor: DELL P2415Q, Network: Intel Device 15f3 + Intel Wi-Fi 6 AX201
OS: Ubuntu 20.04, Kernel: 5.4.0-37-generic (x86_64), Desktop: GNOME Shell 3.36.2, Display Server: X Server 1.20.8, Display Driver: amdgpu 19.1.0, OpenGL: 4.6 Mesa 20.0.4 (LLVM 9.0.1), Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xc8Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
fftw + oneDNN OpenBenchmarking.org Phoronix Test Suite Intel Core i9-10900K @ 5.30GHz (10 Cores / 20 Threads) Gigabyte Z490 AORUS MASTER (F3 BIOS) Intel Comet Lake PCH 16GB Samsung SSD 970 EVO 250GB AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) Realtek ALC1220 DELL P2415Q Intel Device 15f3 + Intel Wi-Fi 6 AX201 Ubuntu 20.04 5.4.0-37-generic (x86_64) GNOME Shell 3.36.2 X Server 1.20.8 amdgpu 19.1.0 4.6 Mesa 20.0.4 (LLVM 9.0.1) GCC 9.3.0 ext4 3840x2160 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Fftw + OneDNN Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xc8 - itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
fftw + oneDNN fftw: Stock - 1D FFT Size 32 fftw: Stock - 1D FFT Size 64 fftw: Stock - 2D FFT Size 32 fftw: Stock - 2D FFT Size 64 fftw: Stock - 1D FFT Size 128 fftw: Stock - 1D FFT Size 256 fftw: Stock - 1D FFT Size 512 fftw: Stock - 2D FFT Size 128 fftw: Stock - 2D FFT Size 256 fftw: Stock - 2D FFT Size 512 fftw: Stock - 1D FFT Size 1024 fftw: Stock - 1D FFT Size 2048 fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 1024 fftw: Stock - 2D FFT Size 2048 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 32 fftw: Float + SSE - 1D FFT Size 64 fftw: Float + SSE - 2D FFT Size 32 fftw: Float + SSE - 2D FFT Size 64 fftw: Float + SSE - 1D FFT Size 128 fftw: Float + SSE - 1D FFT Size 256 fftw: Float + SSE - 1D FFT Size 512 fftw: Float + SSE - 2D FFT Size 128 fftw: Float + SSE - 2D FFT Size 256 fftw: Float + SSE - 2D FFT Size 512 fftw: Float + SSE - 1D FFT Size 1024 fftw: Float + SSE - 1D FFT Size 2048 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 1024 fftw: Float + SSE - 2D FFT Size 2048 fftw: Float + SSE - 2D FFT Size 4096 mkl-dnn: IP Batch 1D - f32 mkl-dnn: IP Batch All - f32 mkl-dnn: IP Batch 1D - u8s8f32 mkl-dnn: IP Batch All - u8s8f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 mkl-dnn: Deconvolution Batch deconv_3d - f32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32 mkl-dnn: Recurrent Neural Network Training - f32 mkl-dnn: Recurrent Neural Network Inference - f32 Intel Core i9-10900K 10261.0 10126 11584 10229 9828.6 10157 10404 9643.4 9241.1 9424.9 10459 9976.8 9654.5 9156.6 7796.2 7265.5 20898 26045 60554 54752 31136 43459 51405 40284 38113 40631 56835 56885 52286 40919 31665 27971 3.25265 57.1582 1.17351 19.0826 3.02711 4.77248 123.946 2.20842 170.449 27.9311 OpenBenchmarking.org
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 168.28, N = 3 10261.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 64 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 8.01, N = 3 10126 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 7.13, N = 3 11584 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 64 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 54.85, N = 3 10229 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 128 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 111.03, N = 7 9828.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 256 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 2.67, N = 3 10157 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 512 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 7.94, N = 3 10404 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 128 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 36.79, N = 3 9643.4 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 256 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 38.09, N = 3 9241.1 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 512 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 36.95, N = 3 9424.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 28.52, N = 3 10459 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 2048 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 15.39, N = 3 9976.8 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 80.99, N = 3 9654.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 19.40, N = 3 9156.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 Intel Core i9-10900K 2K 4K 6K 8K 10K SE +/- 20.04, N = 3 7796.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Intel Core i9-10900K 1600 3200 4800 6400 8000 SE +/- 16.94, N = 3 7265.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Intel Core i9-10900K 4K 8K 12K 16K 20K SE +/- 53.54, N = 3 20898 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 64 Intel Core i9-10900K 6K 12K 18K 24K 30K SE +/- 385.60, N = 3 26045 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 Intel Core i9-10900K 13K 26K 39K 52K 65K SE +/- 351.90, N = 3 60554 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 64 Intel Core i9-10900K 12K 24K 36K 48K 60K SE +/- 753.98, N = 3 54752 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 128 Intel Core i9-10900K 7K 14K 21K 28K 35K SE +/- 273.58, N = 3 31136 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 256 Intel Core i9-10900K 9K 18K 27K 36K 45K SE +/- 713.09, N = 3 43459 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 512 Intel Core i9-10900K 11K 22K 33K 44K 55K SE +/- 573.69, N = 3 51405 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 128 Intel Core i9-10900K 9K 18K 27K 36K 45K SE +/- 459.89, N = 3 40284 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 256 Intel Core i9-10900K 8K 16K 24K 32K 40K SE +/- 578.44, N = 3 38113 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 512 Intel Core i9-10900K 9K 18K 27K 36K 45K SE +/- 96.36, N = 3 40631 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 1024 Intel Core i9-10900K 12K 24K 36K 48K 60K SE +/- 176.97, N = 3 56835 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 Intel Core i9-10900K 12K 24K 36K 48K 60K SE +/- 271.81, N = 3 56885 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Intel Core i9-10900K 11K 22K 33K 44K 55K SE +/- 408.92, N = 3 52286 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 1024 Intel Core i9-10900K 9K 18K 27K 36K 45K SE +/- 70.82, N = 3 40919 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 Intel Core i9-10900K 7K 14K 21K 28K 35K SE +/- 135.51, N = 3 31665 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Intel Core i9-10900K 6K 12K 18K 24K 30K SE +/- 305.15, N = 3 27971 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
oneDNN MKL-DNN This is a test of the Intel oneDNN (formerly DNNL / Deep Neural Network Library / MKL-DNN) as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: IP Batch 1D - Data Type: f32 Intel Core i9-10900K 0.7318 1.4636 2.1954 2.9272 3.659 SE +/- 0.02168, N = 3 3.25265 MIN: 3.01 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: IP Batch All - Data Type: f32 Intel Core i9-10900K 13 26 39 52 65 SE +/- 0.10, N = 3 57.16 MIN: 56.62 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: IP Batch 1D - Data Type: u8s8f32 Intel Core i9-10900K 0.264 0.528 0.792 1.056 1.32 SE +/- 0.00077, N = 3 1.17351 MIN: 1.16 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: IP Batch All - Data Type: u8s8f32 Intel Core i9-10900K 5 10 15 20 25 SE +/- 0.01, N = 3 19.08 MIN: 18.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: Deconvolution Batch deconv_1d - Data Type: f32 Intel Core i9-10900K 0.6811 1.3622 2.0433 2.7244 3.4055 SE +/- 0.00360, N = 3 3.02711 MIN: 2.99 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: Deconvolution Batch deconv_3d - Data Type: f32 Intel Core i9-10900K 1.0738 2.1476 3.2214 4.2952 5.369 SE +/- 0.00783, N = 3 4.77248 MIN: 4.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 Intel Core i9-10900K 30 60 90 120 150 SE +/- 2.86, N = 15 123.95 MIN: 109.96 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 Intel Core i9-10900K 0.4969 0.9938 1.4907 1.9876 2.4845 SE +/- 0.00516, N = 3 2.20842 MIN: 2.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: Recurrent Neural Network Training - Data Type: f32 Intel Core i9-10900K 40 80 120 160 200 SE +/- 1.03, N = 3 170.45 MIN: 165.57 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN MKL-DNN 1.3 Harness: Recurrent Neural Network Inference - Data Type: f32 Intel Core i9-10900K 7 14 21 28 35 SE +/- 0.03, N = 3 27.93 MIN: 27.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
Intel Core i9-10900K Processor: Intel Core i9-10900K @ 5.30GHz (10 Cores / 20 Threads), Motherboard: Gigabyte Z490 AORUS MASTER (F3 BIOS), Chipset: Intel Comet Lake PCH, Memory: 16GB, Disk: Samsung SSD 970 EVO 250GB, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: Realtek ALC1220, Monitor: DELL P2415Q, Network: Intel Device 15f3 + Intel Wi-Fi 6 AX201
OS: Ubuntu 20.04, Kernel: 5.4.0-37-generic (x86_64), Desktop: GNOME Shell 3.36.2, Display Server: X Server 1.20.8, Display Driver: amdgpu 19.1.0, OpenGL: 4.6 Mesa 20.0.4 (LLVM 9.0.1), Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xc8Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 13 June 2020 06:09 by user phoronix.