NKL DNN UPDATED Intel Core i7-8086K testing with a ASUS PRIME Z370-A (1802 BIOS) and Intel UHD 630 3GB on Ubuntu 18.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1904192-PTS-NKLDNNUP40 Intel Core i7-8086K Processor: Intel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads), Motherboard: ASUS PRIME Z370-A (1802 BIOS), Chipset: Intel 8th Gen Core, Memory: 8192MB, Disk: 118GB INTEL SSDPEK1W120GA, Graphics: Intel UHD 630 3GB (1200MHz), Audio: Realtek ALC1220, Monitor: DELL P2415Q, Network: Intel I219-V
OS: Ubuntu 18.10, Kernel: 4.18.0-17-generic (x86_64), Desktop: GNOME Shell 3.30.2, Display Server: X Server 1.20.1, Display Driver: modesetting 1.20.1, OpenGL: 4.5 Mesa 18.2.2, Vulkan: 1.1.80, Compiler: GCC 8.2.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersaveSecurity Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW STIBP + SSB disabled via prctl and seccomp + PTE Inversion
NKL DNN UPDATED OpenBenchmarking.org Phoronix Test Suite Intel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads) ASUS PRIME Z370-A (1802 BIOS) Intel 8th Gen Core 8192MB 118GB INTEL SSDPEK1W120GA Intel UHD 630 3GB (1200MHz) Realtek ALC1220 DELL P2415Q Intel I219-V Ubuntu 18.10 4.18.0-17-generic (x86_64) GNOME Shell 3.30.2 X Server 1.20.1 modesetting 1.20.1 4.5 Mesa 18.2.2 1.1.80 GCC 8.2.0 ext4 3840x2160 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution NKL DNN UPDATED Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW STIBP + SSB disabled via prctl and seccomp + PTE Inversion
NKL DNN UPDATED mkl-dnn: IP Batch 1D - f32 mkl-dnn: IP Batch All - f32 mkl-dnn: IP Batch 1D - u8s8u8s32 mkl-dnn: IP Batch 1D - u8s8f32s32 mkl-dnn: IP Batch All - u8s8u8s32 mkl-dnn: IP Batch All - u8s8f32s32 mkl-dnn: Convolution Batch conv_3d - f32 mkl-dnn: Convolution Batch conv_all - f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 mkl-dnn: Deconvolution Batch deconv_3d - f32 mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: Deconvolution Batch deconv_all - f32 mkl-dnn: Convolution Batch conv_3d - u8s8u8s32 mkl-dnn: Convolution Batch conv_3d - u8s8f32s32 mkl-dnn: Convolution Batch conv_all - u8s8u8s32 mkl-dnn: Convolution Batch conv_all - u8s8f32s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8u8s32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8u8s32 mkl-dnn: Convolution Batch conv_alexnet - u8s8u8s32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32s32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32s32 mkl-dnn: Deconvolution Batch deconv_all - u8s8u8s32 mkl-dnn: Convolution Batch conv_alexnet - u8s8f32s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8u8s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32s32 Intel Core i7-8086K 10.12 139.89 6.43 6.43 77.92 76.93 25.32 3419.20 6.99 8.75 447.67 4061.01 23974.07 23972.13 26742.97 26346.83 191.66 8431.36 13754.17 978.27 8238.11 13484.60 25675 934.86 446.32 422.18 OpenBenchmarking.org
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 Intel Core i7-8086K 3 6 9 12 15 SE +/- 0.10, N = 3 10.12 MIN: 5.74 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: f32 Intel Core i7-8086K 30 60 90 120 150 SE +/- 1.41, N = 3 139.89 MIN: 84.39 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8u8s32 Intel Core i7-8086K 2 4 6 8 10 SE +/- 0.04, N = 3 6.43 MIN: 3.49 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8f32s32 Intel Core i7-8086K 2 4 6 8 10 SE +/- 0.04, N = 3 6.43 MIN: 3.42 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8u8s32 Intel Core i7-8086K 20 40 60 80 100 SE +/- 0.18, N = 3 77.92 MIN: 42.22 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8f32s32 Intel Core i7-8086K 20 40 60 80 100 SE +/- 0.80, N = 3 76.93 MIN: 41.82 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: f32 Intel Core i7-8086K 6 12 18 24 30 SE +/- 0.01, N = 3 25.32 MIN: 25.11 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: f32 Intel Core i7-8086K 700 1400 2100 2800 3500 SE +/- 0.31, N = 3 3419.20 MIN: 3409.78 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 Intel Core i7-8086K 2 4 6 8 10 SE +/- 0.02, N = 3 6.99 MIN: 6.82 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: f32 Intel Core i7-8086K 2 4 6 8 10 SE +/- 0.01, N = 3 8.75 MIN: 8.7 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 Intel Core i7-8086K 100 200 300 400 500 SE +/- 1.07, N = 3 447.67 MIN: 445.63 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 Intel Core i7-8086K 900 1800 2700 3600 4500 SE +/- 10.01, N = 3 4061.01 MIN: 3789.48 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8u8s32 Intel Core i7-8086K 5K 10K 15K 20K 25K SE +/- 10.94, N = 3 23974.07 MIN: 23869.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8f32s32 Intel Core i7-8086K 5K 10K 15K 20K 25K SE +/- 7.71, N = 3 23972.13 MIN: 23867.6 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8u8s32 Intel Core i7-8086K 6K 12K 18K 24K 30K SE +/- 7.16, N = 3 26742.97 MIN: 26118.8 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8f32s32 Intel Core i7-8086K 6K 12K 18K 24K 30K SE +/- 16.85, N = 3 26346.83 MIN: 25778.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 Intel Core i7-8086K 40 80 120 160 200 SE +/- 0.09, N = 3 191.66 MIN: 190.48 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8u8s32 Intel Core i7-8086K 2K 4K 6K 8K 10K SE +/- 0.93, N = 3 8431.36 MIN: 8404.01 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8u8s32 Intel Core i7-8086K 3K 6K 9K 12K 15K SE +/- 3.03, N = 3 13754.17 MIN: 13725.5 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8u8s32 Intel Core i7-8086K 200 400 600 800 1000 SE +/- 9.64, N = 15 978.27 MIN: 870.31 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32s32 Intel Core i7-8086K 2K 4K 6K 8K 10K SE +/- 4.77, N = 3 8238.11 MIN: 8206.65 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32s32 Intel Core i7-8086K 3K 6K 9K 12K 15K SE +/- 9.28, N = 3 13484.60 MIN: 13453.6 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: u8s8u8s32 Intel Core i7-8086K 5K 10K 15K 20K 25K SE +/- 28.36, N = 3 25675 MIN: 25204.7 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32s32 Intel Core i7-8086K 200 400 600 800 1000 SE +/- 10.58, N = 3 934.86 MIN: 844.94 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8u8s32 Intel Core i7-8086K 100 200 300 400 500 SE +/- 1.82, N = 3 446.32 MIN: 385.09 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32s32 Intel Core i7-8086K 90 180 270 360 450 SE +/- 0.70, N = 3 422.18 MIN: 364.37 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Intel Core i7-8086K Processor: Intel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads), Motherboard: ASUS PRIME Z370-A (1802 BIOS), Chipset: Intel 8th Gen Core, Memory: 8192MB, Disk: 118GB INTEL SSDPEK1W120GA, Graphics: Intel UHD 630 3GB (1200MHz), Audio: Realtek ALC1220, Monitor: DELL P2415Q, Network: Intel I219-V
OS: Ubuntu 18.10, Kernel: 4.18.0-17-generic (x86_64), Desktop: GNOME Shell 3.30.2, Display Server: X Server 1.20.1, Display Driver: modesetting 1.20.1, OpenGL: 4.5 Mesa 18.2.2, Vulkan: 1.1.80, Compiler: GCC 8.2.0, File-System: ext4, Screen Resolution: 3840x2160
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersaveSecurity Notes: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW STIBP + SSB disabled via prctl and seccomp + PTE Inversion
Testing initiated at 18 April 2019 21:51 by user pts.