mkl-dnn + fftw

Intel Core i5-10600K testing with a ASUS PRIME Z490M-PLUS (0603 BIOS) and Sapphire AMD Radeon RX 470/480/570/570X/580/580X/590 8GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2006163-NE-MKLDNNFFT18&gru.

mkl-dnn + fftwProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionIntel Core i3-10100Intel Core i5-10600KIntel Core i3-10100 @ 4.30GHz (4 Cores / 8 Threads)ASUS PRIME Z490M-PLUS (0603 BIOS)Intel Comet Lake PCH16GB240GB Force MP510 + 2000GB Samsung SSD 860Sapphire AMD Radeon RX 470/480/570/570X/580/580X/590 8GB (1560/2100MHz)Realtek ALC887-VDASUS MG28UIntelUbuntu 20.045.7.0-rc6-amd-energy (x86_64) 20200527GNOME Shell 3.36.2X Server 1.20.8modesetting 1.20.84.6 Mesa 20.0.4 (LLVM 9.0.1)GCC 9.3.0ext43840x2160Intel Core i5-10600K @ 4.80GHz (6 Cores / 12 Threads)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Intel Core i3-10100: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xcc- Intel Core i5-10600K: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xc8Security Details- itlb_multihit: KVM: Vulnerable + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + tsx_async_abort: Not affected

mkl-dnn + fftwfftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 2D FFT Size 64fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 512fftw: Stock - 2D FFT Size 128fftw: Stock - 2D FFT Size 256fftw: Stock - 2D FFT Size 512fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096mkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: IP Batch 1D - u8s8f32mkl-dnn: IP Batch All - u8s8f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32mkl-dnn: Recurrent Neural Network Training - f32mkl-dnn: Recurrent Neural Network Inference - f32Intel Core i3-10100Intel Core i5-10600K8698.88482.19711.38257.57968.38153.38307.47932.27520.57495.88449.98070.17895.16657.96258.65964.1169752111249207438852520535177441053395032998346774751047673445982990926279243547.32233101.9233.1362742.93578.4200812.5169318.8596.47082428.081113.1429869.19457.0108599279.09205.09197.29436.78708.88467.08434.59478.68965.58752.57477.36548.76334.2184422330055199492212790238866490093842036522387095331453506494933769826053227674.5491374.62472.0233930.06365.464028.48212222.3884.41247285.86252.0325OpenBenchmarking.org

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 125.21, N = 4SE +/- 30.95, N = 38698.89869.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 25.76, N = 3SE +/- 73.01, N = 38482.19457.01. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 26.94, N = 3SE +/- 35.35, N = 39711.310859.01. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 23.23, N = 3SE +/- 34.34, N = 38257.59279.01. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 12.44, N = 3SE +/- 20.43, N = 37968.39205.01. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 34.58, N = 3SE +/- 7.65, N = 38153.39197.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 512Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 84.60, N = 3SE +/- 10.79, N = 38307.49436.71. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 7.05, N = 3SE +/- 61.31, N = 37932.28708.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 59.45, N = 3SE +/- 15.71, N = 37520.58467.01. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 8.90, N = 3SE +/- 77.07, N = 37495.88434.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 8.11, N = 3SE +/- 13.72, N = 38449.99478.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 26.75, N = 3SE +/- 36.66, N = 38070.18965.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096Intel Core i3-10100Intel Core i5-10600K2K4K6K8K10KSE +/- 34.60, N = 3SE +/- 38.60, N = 37895.18752.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024Intel Core i3-10100Intel Core i5-10600K16003200480064008000SE +/- 6.50, N = 3SE +/- 83.43, N = 36657.97477.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048Intel Core i3-10100Intel Core i5-10600K14002800420056007000SE +/- 18.15, N = 3SE +/- 11.05, N = 36258.66548.71. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096Intel Core i3-10100Intel Core i5-10600K14002800420056007000SE +/- 3.56, N = 3SE +/- 28.65, N = 35964.16334.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32Intel Core i3-10100Intel Core i5-10600K4K8K12K16K20KSE +/- 30.85, N = 3SE +/- 278.43, N = 316975184421. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64Intel Core i3-10100Intel Core i5-10600K5K10K15K20K25KSE +/- 204.70, N = 3SE +/- 243.60, N = 821112233001. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32Intel Core i3-10100Intel Core i5-10600K12K24K36K48K60KSE +/- 315.98, N = 3SE +/- 219.30, N = 349207551991. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64Intel Core i3-10100Intel Core i5-10600K11K22K33K44K55KSE +/- 600.09, N = 3SE +/- 661.09, N = 343885492211. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128Intel Core i3-10100Intel Core i5-10600K6K12K18K24K30KSE +/- 248.24, N = 3SE +/- 349.46, N = 325205279021. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256Intel Core i3-10100Intel Core i5-10600K8K16K24K32K40KSE +/- 197.03, N = 3SE +/- 465.25, N = 335177388661. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512Intel Core i3-10100Intel Core i5-10600K10K20K30K40K50KSE +/- 88.21, N = 3SE +/- 212.58, N = 344105490091. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128Intel Core i3-10100Intel Core i5-10600K8K16K24K32K40KSE +/- 462.07, N = 4SE +/- 493.36, N = 333950384201. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 256Intel Core i3-10100Intel Core i5-10600K8K16K24K32K40KSE +/- 80.84, N = 3SE +/- 47.79, N = 332998365221. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512Intel Core i3-10100Intel Core i5-10600K8K16K24K32K40KSE +/- 33.93, N = 3SE +/- 165.77, N = 334677387091. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024Intel Core i3-10100Intel Core i5-10600K11K22K33K44K55KSE +/- 219.08, N = 3SE +/- 58.54, N = 347510533141. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048Intel Core i3-10100Intel Core i5-10600K11K22K33K44K55KSE +/- 63.72, N = 3SE +/- 514.62, N = 347673535061. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096Intel Core i3-10100Intel Core i5-10600K11K22K33K44K55KSE +/- 574.55, N = 5SE +/- 510.41, N = 344598494931. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024Intel Core i3-10100Intel Core i5-10600K8K16K24K32K40KSE +/- 132.83, N = 3SE +/- 111.44, N = 329909376981. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048Intel Core i3-10100Intel Core i5-10600K6K12K18K24K30KSE +/- 191.43, N = 3SE +/- 130.95, N = 326279260531. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Intel Core i3-10100Intel Core i5-10600K5K10K15K20K25KSE +/- 129.05, N = 3SE +/- 191.44, N = 324354227671. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: IP Batch 1D - Data Type: f32Intel Core i3-10100Intel Core i5-10600K246810SE +/- 0.02171, N = 3SE +/- 0.02093, N = 37.322334.54913MIN: 6.83MIN: 4.441. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: IP Batch All - Data Type: f32Intel Core i3-10100Intel Core i5-10600K20406080100SE +/- 0.11, N = 3SE +/- 0.06, N = 3101.9274.62MIN: 100.36MIN: 73.651. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: IP Batch 1D - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: IP Batch 1D - Data Type: u8s8f32Intel Core i3-10100Intel Core i5-10600K0.70571.41142.11712.82283.5285SE +/- 0.00313, N = 3SE +/- 0.00326, N = 33.136272.02339MIN: 3.11MIN: 1.971. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: IP Batch All - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: IP Batch All - Data Type: u8s8f32Intel Core i3-10100Intel Core i5-10600K1020304050SE +/- 0.07, N = 3SE +/- 0.01, N = 342.9430.06MIN: 42.52MIN: 29.711. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: Deconvolution Batch deconv_1d - Data Type: f32Intel Core i3-10100Intel Core i5-10600K246810SE +/- 0.01195, N = 3SE +/- 0.01618, N = 38.420085.46402MIN: 8.36MIN: 5.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: Deconvolution Batch deconv_3d - Data Type: f32Intel Core i3-10100Intel Core i5-10600K3691215SE +/- 0.00097, N = 3SE +/- 0.00750, N = 312.516908.48212MIN: 12.34MIN: 8.371. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32Intel Core i3-10100Intel Core i5-10600K70140210280350SE +/- 3.88, N = 3SE +/- 3.25, N = 3318.86222.39MIN: 313.59MIN: 216.651. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32Intel Core i3-10100Intel Core i5-10600K246810SE +/- 0.00655, N = 3SE +/- 0.00447, N = 36.470824.41247MIN: 6.43MIN: 4.371. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: Recurrent Neural Network Training - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: Recurrent Neural Network Training - Data Type: f32Intel Core i3-10100Intel Core i5-10600K90180270360450SE +/- 0.32, N = 3SE +/- 0.49, N = 3428.08285.86MIN: 426.8MIN: 283.71. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

oneDNN MKL-DNN

Harness: Recurrent Neural Network Inference - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetteroneDNN MKL-DNN 1.3Harness: Recurrent Neural Network Inference - Data Type: f32Intel Core i3-10100Intel Core i5-10600K306090120150SE +/- 0.25, N = 3SE +/- 0.22, N = 3113.1452.03MIN: 112.24MIN: 50.71. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl


Phoronix Test Suite v10.8.4