MKL-DNN Xeon v6 Intel Xeon E3-1275 v6 testing with a ASUS P10S-M WS (4401 BIOS) and Intel HD P630 3GB on Ubuntu 18.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1910029-AS-MKLDNNXEO21 Intel Xeon E3-1275 v6 Processor: Intel Xeon E3-1275 v6 @ 4.20GHz (4 Cores / 8 Threads), Motherboard: ASUS P10S-M WS (4401 BIOS), Chipset: Intel Xeon E3-1200 v6/7th, Memory: 16384MB, Disk: Samsung SSD 970 EVO Plus 500GB, Graphics: Intel HD P630 3GB (1150MHz), Audio: Realtek ALC1150, Monitor: VA2431, Network: 2 x Intel I210
OS: Ubuntu 18.04, Kernel: 4.18.0-25-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: modesetting 1.20.1, OpenGL: 4.5 Mesa 18.2.8, Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersaveSecurity Notes: l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling
MKL-DNN Xeon v6 OpenBenchmarking.org Phoronix Test Suite Intel Xeon E3-1275 v6 @ 4.20GHz (4 Cores / 8 Threads) ASUS P10S-M WS (4401 BIOS) Intel Xeon E3-1200 v6/7th 16384MB Samsung SSD 970 EVO Plus 500GB Intel HD P630 3GB (1150MHz) Realtek ALC1150 VA2431 2 x Intel I210 Ubuntu 18.04 4.18.0-25-generic (x86_64) GNOME Shell 3.28.3 X Server 1.20.1 modesetting 1.20.1 4.5 Mesa 18.2.8 GCC 7.4.0 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution MKL-DNN Xeon V6 Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling
MKL-DNN Xeon v6 mkl-dnn: IP Batch 1D - f32 mkl-dnn: IP Batch All - f32 mkl-dnn: IP Batch 1D - u8s8u8s32 mkl-dnn: IP Batch 1D - u8s8f32s32 mkl-dnn: IP Batch All - u8s8u8s32 mkl-dnn: IP Batch All - u8s8f32s32 mkl-dnn: Convolution Batch conv_3d - f32 mkl-dnn: Convolution Batch conv_all - f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 mkl-dnn: Deconvolution Batch deconv_3d - f32 mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: Deconvolution Batch deconv_all - f32 mkl-dnn: Convolution Batch conv_3d - u8s8u8s32 mkl-dnn: Convolution Batch conv_3d - u8s8f32s32 mkl-dnn: Convolution Batch conv_all - u8s8u8s32 mkl-dnn: Convolution Batch conv_all - u8s8f32s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8u8s32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8u8s32 mkl-dnn: Convolution Batch conv_alexnet - u8s8u8s32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32s32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32s32 mkl-dnn: Deconvolution Batch deconv_all - u8s8u8s32 mkl-dnn: Convolution Batch conv_alexnet - u8s8f32s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8u8s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32s32 Intel Xeon E3-1275 v6 11.04 174.51 6.78 6.68 110.33 111.68 35.64 5379.77 10.99 12.97 705.89 4430.39 26419.03 26492.73 31922.03 30908.37 300.93 12960.33 21667.30 1154.58 13295.77 21979.67 39074.47 1038.75 632.64 550.57 OpenBenchmarking.org
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 Intel Xeon E3-1275 v6 3 6 9 12 15 SE +/- 0.14, N = 15 11.04 MIN: 8.84 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: f32 Intel Xeon E3-1275 v6 40 80 120 160 200 SE +/- 1.64, N = 15 174.51 MIN: 112.58 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 2 4 6 8 10 SE +/- 0.08, N = 15 6.78 MIN: 5.53 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 2 4 6 8 10 SE +/- 0.07, N = 15 6.68 MIN: 5.43 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 20 40 60 80 100 SE +/- 0.61, N = 3 110.33 MIN: 66.64 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 30 60 90 120 150 SE +/- 0.78, N = 3 111.68 MIN: 65.7 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: f32 Intel Xeon E3-1275 v6 8 16 24 32 40 SE +/- 0.07, N = 3 35.64 MIN: 35.27 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: f32 Intel Xeon E3-1275 v6 1200 2400 3600 4800 6000 SE +/- 0.61, N = 3 5379.77 MIN: 5367.58 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 Intel Xeon E3-1275 v6 3 6 9 12 15 SE +/- 0.03, N = 3 10.99 MIN: 10.84 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: f32 Intel Xeon E3-1275 v6 3 6 9 12 15 SE +/- 0.01, N = 3 12.97 MIN: 12.76 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 Intel Xeon E3-1275 v6 150 300 450 600 750 SE +/- 0.64, N = 3 705.89 MIN: 702.61 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 Intel Xeon E3-1275 v6 900 1800 2700 3600 4500 SE +/- 1.84, N = 3 4430.39 MIN: 4393.25 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 6K 12K 18K 24K 30K SE +/- 49.00, N = 3 26419.03 MIN: 26356.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 6K 12K 18K 24K 30K SE +/- 7.59, N = 3 26492.73 MIN: 26469.4 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 7K 14K 21K 28K 35K SE +/- 9.20, N = 3 31922.03 MIN: 31504.3 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 7K 14K 21K 28K 35K SE +/- 47.28, N = 3 30908.37 MIN: 30490.3 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 Intel Xeon E3-1275 v6 70 140 210 280 350 SE +/- 0.15, N = 3 300.93 MIN: 299.48 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 3K 6K 9K 12K 15K SE +/- 4.83, N = 3 12960.33 MIN: 12947.8 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 5K 10K 15K 20K 25K SE +/- 7.54, N = 3 21667.30 MIN: 21640.3 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 200 400 600 800 1000 SE +/- 0.83, N = 3 1154.58 MIN: 1088.88 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 3K 6K 9K 12K 15K SE +/- 11.12, N = 3 13295.77 MIN: 13272.3 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 5K 10K 15K 20K 25K SE +/- 5.01, N = 3 21979.67 MIN: 21965.1 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 8K 16K 24K 32K 40K SE +/- 22.10, N = 3 39074.47 MIN: 38593 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 200 400 600 800 1000 SE +/- 10.05, N = 3 1038.75 MIN: 994.01 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8u8s32 Intel Xeon E3-1275 v6 140 280 420 560 700 SE +/- 2.59, N = 3 632.64 MIN: 591.17 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32s32 Intel Xeon E3-1275 v6 120 240 360 480 600 SE +/- 2.56, N = 3 550.57 MIN: 513.87 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
Intel Xeon E3-1275 v6 Processor: Intel Xeon E3-1275 v6 @ 4.20GHz (4 Cores / 8 Threads), Motherboard: ASUS P10S-M WS (4401 BIOS), Chipset: Intel Xeon E3-1200 v6/7th, Memory: 16384MB, Disk: Samsung SSD 970 EVO Plus 500GB, Graphics: Intel HD P630 3GB (1150MHz), Audio: Realtek ALC1150, Monitor: VA2431, Network: 2 x Intel I210
OS: Ubuntu 18.04, Kernel: 4.18.0-25-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: modesetting 1.20.1, OpenGL: 4.5 Mesa 18.2.8, Compiler: GCC 7.4.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersaveSecurity Notes: l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling
Testing initiated at 1 October 2019 18:08 by user phoronix.