MKL-DNN Cascade Lake 2 x Intel Xeon Platinum 8280 testing with a GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) and ASPEED on Ubuntu 19.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1905231-HV-MKLDNNCAS40 2 x Intel Xeon Platinum 8280 Processor: 2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads), Motherboard: GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 386048MB, Disk: Samsung SSD 970 PRO 512GB, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE
OS: Ubuntu 19.04, Kernel: 5.0.0-15-generic (x86_64), Desktop: GNOME Shell 3.32.0, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, Compiler: GCC 8.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersaveSecurity Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling
MKL-DNN Cascade Lake OpenBenchmarking.org Phoronix Test Suite 2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads) GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) Intel Sky Lake-E DMI3 Registers 386048MB Samsung SSD 970 PRO 512GB ASPEED VE228 2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE Ubuntu 19.04 5.0.0-15-generic (x86_64) GNOME Shell 3.32.0 X Server 1.20.4 modesetting 1.20.4 GCC 8.3.0 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution MKL-DNN Cascade Lake Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling
MKL-DNN Cascade Lake mkl-dnn: Convolution Batch conv_all - u8s8u8s32 mkl-dnn: Convolution Batch conv_all - u8s8f32s32 mkl-dnn: Convolution Batch conv_all - f32 mkl-dnn: Deconvolution Batch deconv_all - u8s8u8s32 mkl-dnn: Deconvolution Batch deconv_all - f32 mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8u8s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32s32 mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 mkl-dnn: IP Batch 1D - u8s8u8s32 mkl-dnn: Convolution Batch conv_3d - u8s8u8s32 mkl-dnn: Convolution Batch conv_3d - u8s8f32s32 mkl-dnn: IP Batch All - f32 mkl-dnn: IP Batch All - u8s8f32s32 mkl-dnn: IP Batch All - u8s8u8s32 mkl-dnn: Convolution Batch conv_3d - f32 mkl-dnn: IP Batch 1D - f32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8u8s32 mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32s32 mkl-dnn: Deconvolution Batch deconv_1d - f32 mkl-dnn: Convolution Batch conv_alexnet - u8s8u8s32 mkl-dnn: Convolution Batch conv_alexnet - u8s8f32s32 mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: IP Batch 1D - u8s8f32s32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8u8s32 mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32s32 mkl-dnn: Deconvolution Batch deconv_3d - f32 2 x Intel Xeon Platinum 8280 1716.72 1754.68 388.65 3670.07 1035.47 5.80 6.23 21.60 3.53 3547.34 3536.77 98.64 20.15 20.20 3.32 11.75 0.24 0.23 0.96 14.38 19.03 48.89 3.62 2040.83 1891.74 1.11 OpenBenchmarking.org
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 400 800 1200 1600 2000 SE +/- 1.44, N = 3 1716.72 MIN: 1710.78 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 400 800 1200 1600 2000 SE +/- 7.76, N = 3 1754.68 MIN: 1740.17 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_all - Data Type: f32 2 x Intel Xeon Platinum 8280 80 160 240 320 400 SE +/- 1.18, N = 3 388.65 MIN: 375.32 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 800 1600 2400 3200 4000 SE +/- 5.87, N = 3 3670.07 MIN: 3608.7 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 2 x Intel Xeon Platinum 8280 200 400 600 800 1000 SE +/- 16.39, N = 3 1035.47 MIN: 962.18 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 1.305 2.61 3.915 5.22 6.525 SE +/- 0.01, N = 3 5.80 MIN: 5.52 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 2 4 6 8 10 SE +/- 0.03, N = 3 6.23 MIN: 5.88 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 2 x Intel Xeon Platinum 8280 5 10 15 20 25 SE +/- 0.20, N = 3 21.60 MIN: 20.73 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 0.7943 1.5886 2.3829 3.1772 3.9715 SE +/- 0.04, N = 15 3.53 MIN: 2.15 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 800 1600 2400 3200 4000 SE +/- 7.10, N = 3 3547.34 MIN: 3524.36 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 800 1600 2400 3200 4000 SE +/- 6.90, N = 3 3536.77 MIN: 3520.75 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: f32 2 x Intel Xeon Platinum 8280 20 40 60 80 100 SE +/- 0.74, N = 3 98.64 MIN: 63.99 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 5 10 15 20 25 SE +/- 0.07, N = 3 20.15 MIN: 12.53 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch All - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 5 10 15 20 25 SE +/- 0.05, N = 3 20.20 MIN: 13.43 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_3d - Data Type: f32 2 x Intel Xeon Platinum 8280 0.747 1.494 2.241 2.988 3.735 SE +/- 0.01, N = 3 3.32 MIN: 3.17 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 2 x Intel Xeon Platinum 8280 3 6 9 12 15 SE +/- 0.14, N = 6 11.75 MIN: 6.61 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 0.054 0.108 0.162 0.216 0.27 SE +/- 0.00, N = 3 0.24 MIN: 0.22 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 0.0518 0.1036 0.1554 0.2072 0.259 SE +/- 0.00, N = 3 0.23 MIN: 0.21 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 2 x Intel Xeon Platinum 8280 0.216 0.432 0.648 0.864 1.08 SE +/- 0.01, N = 3 0.96 MIN: 0.91 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 4 8 12 16 20 SE +/- 0.12, N = 3 14.38 MIN: 13.92 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 5 10 15 20 25 SE +/- 0.08, N = 3 19.03 MIN: 18.23 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 2 x Intel Xeon Platinum 8280 11 22 33 44 55 SE +/- 0.59, N = 3 48.89 MIN: 47.42 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 0.8145 1.629 2.4435 3.258 4.0725 SE +/- 0.03, N = 3 3.62 MIN: 2.32 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8u8s32 2 x Intel Xeon Platinum 8280 400 800 1200 1600 2000 SE +/- 1.92, N = 3 2040.83 MIN: 2036.48 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32s32 2 x Intel Xeon Platinum 8280 400 800 1200 1600 2000 SE +/- 0.72, N = 3 1891.74 MIN: 1886.82 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_3d - Data Type: f32 2 x Intel Xeon Platinum 8280 0.2498 0.4996 0.7494 0.9992 1.249 SE +/- 0.00, N = 3 1.11 MIN: 1.07 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
2 x Intel Xeon Platinum 8280 Processor: 2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads), Motherboard: GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 386048MB, Disk: Samsung SSD 970 PRO 512GB, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE
OS: Ubuntu 19.04, Kernel: 5.0.0-15-generic (x86_64), Desktop: GNOME Shell 3.32.0, Display Server: X Server 1.20.4, Display Driver: modesetting 1.20.4, Compiler: GCC 8.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersaveSecurity Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling
Testing initiated at 23 May 2019 15:06 by user phoronix.