onednn_centos7

2 x AMD EPYC 7642 48-Core testing with a Dell 0GK70M (1.5.5 BIOS) and Matrox G200eW3 on CentOS Linux 7 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2311161-NE-ONEDNNCEN98&grt.

onednn_centos7ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen Resolutiongator2 x AMD EPYC 7642 48-Core (96 Cores)Dell 0GK70M (1.5.5 BIOS)AMD Starship/Matisse256GB600GB PERC H345 FrontMatrox G200eW36 x Broadcom NetXtreme BCM5720 2-port PCIe + 2 x Solarflare SFC9120 10GCentOS Linux 73.10.0-1160.24.1.el7.x86_64 (x86_64)GCC 4.8.5 20150623xfs1024x768OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-redhat-linux --disable-libgcj --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,objc,obj-c++,java,fortran,ada,go,lto --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-linker-hash-style=gnu --with-tune=generic - CPU Microcode: 0x8301038

onednn_centos7onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUgator3.433659.712077.031111.460651.6216412.83133.427993.698483.046851.538933540.141891.393545.631925.073608.801950.44OpenBenchmarking.org

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUgator0.77261.54522.31783.09043.863SE +/- 0.00291, N = 33.43365MIN: 2.821. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUgator3691215SE +/- 0.01306, N = 39.71207MIN: 8.911. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUgator246810SE +/- 0.63391, N = 157.03111MIN: 3.21. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUgator0.32860.65720.98581.31441.643SE +/- 0.00550, N = 31.46065MIN: 1.141. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUgator0.36490.72981.09471.45961.8245SE +/- 0.00901, N = 31.62164MIN: 1.31. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUgator3691215SE +/- 0.06, N = 312.83MIN: 5.831. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUgator0.77131.54262.31393.08523.8565SE +/- 0.02385, N = 153.42799MIN: 2.681. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUgator0.83221.66442.49663.32884.161SE +/- 0.03332, N = 153.69848MIN: 2.211. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUgator0.68551.3712.05652.7423.4275SE +/- 0.03418, N = 33.04685MIN: 2.171. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUgator0.34630.69261.03891.38521.7315SE +/- 0.00407, N = 31.53893MIN: 1.451. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUgator8001600240032004000SE +/- 36.98, N = 43540.14MIN: 3260.381. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUgator400800120016002000SE +/- 17.40, N = 31891.39MIN: 1791.551. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUgator8001600240032004000SE +/- 38.91, N = 53545.63MIN: 3308.921. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUgator400800120016002000SE +/- 12.03, N = 31925.07MIN: 1821.141. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUgator8001600240032004000SE +/- 44.07, N = 33608.80MIN: 3277.791. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUgator400800120016002000SE +/- 26.77, N = 31950.44MIN: 1808.481. (CXX) g++ options: -std=c++11 -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -rdynamic -lpthread -lrt -ldl


Phoronix Test Suite v10.8.4