MKL-DNN DNNL AMD EPYC

2 x AMD EPYC 7601 32-Core testing with a Dell 02MJ3T (1.2.5 BIOS) and llvmpipe 504GB on Ubuntu 19.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1910046-AS-1910044AS29
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 7742 2P
October 03 2019
  2 Hours, 4 Minutes
Xeon Platinum 8280 2P
October 04 2019
  1 Hour, 55 Minutes
EPYC 7601 2P
October 04 2019
  3 Hours, 32 Minutes
Invert Hiding All Results Option
  2 Hours, 30 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


MKL-DNN DNNL AMD EPYCProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionEPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)AMD DAYTONA_X (RDY1001C BIOS)AMD Starship/Matisse516096MB280GB INTEL SSDPED1D280GA + 256GB Micron_1100_MTFDllvmpipe 504GBVE2282 x Mellanox MT27710Ubuntu 19.105.3.0-13-generic (x86_64)GNOME Shell 3.34.0X Server 1.20.5modesetting 1.20.53.3 Mesa 19.2.0 (LLVM 9.0 128 bits)GCC 9.2.1 20190909ext41920x10802 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads)GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS)Intel Sky Lake-E DMI3 Registers386048MB280GB INTEL SSDPED1D280GAllvmpipe 377GB2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE3.3 Mesa 19.2.0 (LLVM 9.0 256 bits)2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads)Dell 02MJ3T (1.2.5 BIOS)AMD 17h516096MB280GB INTEL SSDPED1D280GA + 12 x 500GB Samsung SSD 860 + 120GB SSDSCKJB120G7Rllvmpipe 504GB2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA + 2 x Broadcom NetXtreme BCM5720 2-port PCIe3.3 Mesa 19.2.0 (LLVM 9.0 128 bits)1600x1200OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- EPYC 7742 2P: Scaling Governor: acpi-cpufreq ondemand- Xeon Platinum 8280 2P: Scaling Governor: intel_pstate powersaveSecurity Details- EPYC 7742 2P: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling - Xeon Platinum 8280 2P: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling- EPYC 7601 2P: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling

EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2PLogarithmic Result OverviewPhoronix Test SuiteMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLMKL-DNN DNNLIP Batch All - u8s8f32D.B.d - f32C.B.c - u8s8f32C.B.c - f32C.B.c - f32IP Batch All - f32D.B.d - f32C.B.c - f32C.B.c - f32D.B.d - f32R.N.N.T - f32IP Batch 1D - f32C.B.c - u8s8f32D.B.d - u8s8f32D.B.d - u8s8f32C.B.c - u8s8f32IP Batch 1D - u8s8f32C.B.c - u8s8f32

MKL-DNN DNNL AMD EPYCmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: IP Batch 1D - u8s8f32mkl-dnn: IP Batch All - u8s8f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Convolution Batch conv_3d - u8s8f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Convolution Batch conv_all - u8s8f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32mkl-dnn: Recurrent Neural Network Training - f32mkl-dnn: Convolution Batch conv_alexnet - u8s8f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P2.1111.1029.21176.324.32403.501391.482.893.2547.439100.13837.06517.95846.19814.70540.5229.381580.491.429.832.182.904.70494.942989.041.241.1549.791521.72877.840.441881.87223.0319.5123.087.684.9047.8340.53270.5417.272040.282386.594.729.37192.4916846.883470.77900.061526.11709.121179.54116.501673.36OpenBenchmarking.org

MKL-DNN DNNL

This is a test of the Intel MKL-DNN (DNNL / Deep Neural Network Library) as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P1.10252.2053.30754.415.5125SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 32.111.424.90MIN: 1.88MIN: 1.32MIN: 4.241. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P246810Min: 2.1 / Avg: 2.11 / Max: 2.11Min: 1.41 / Avg: 1.42 / Max: 1.43Min: 4.84 / Avg: 4.9 / Max: 4.951. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P1122334455SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 311.109.8347.83MIN: 10.49MIN: 9.47MIN: 46.021. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P1020304050Min: 11.08 / Avg: 11.1 / Max: 11.12Min: 9.81 / Avg: 9.83 / Max: 9.89Min: 47.73 / Avg: 47.83 / Max: 47.961. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P918273645SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 329.212.1840.53MIN: 26.87MIN: 39.011. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P816243240Min: 29.07 / Avg: 29.21 / Max: 29.47Min: 2.17 / Avg: 2.18 / Max: 2.21Min: 40.47 / Avg: 40.53 / Max: 40.561. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P60120180240300SE +/- 0.47, N = 3SE +/- 0.03, N = 3SE +/- 0.50, N = 3176.322.90270.54MIN: 170.32MIN: 263.91. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P50100150200250Min: 175.63 / Avg: 176.32 / Max: 177.21Min: 2.84 / Avg: 2.9 / Max: 2.94Min: 269.56 / Avg: 270.54 / Max: 271.21. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P48121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.75, N = 154.324.7017.27MIN: 3.54MIN: 4.44MIN: 10.571. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P48121620Min: 4.29 / Avg: 4.32 / Max: 4.35Min: 4.65 / Avg: 4.7 / Max: 4.75Min: 11.26 / Avg: 17.27 / Max: 22.731. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P400800120016002000SE +/- 4.94, N = 3SE +/- 0.36, N = 3SE +/- 10.16, N = 3403.50494.942040.28MIN: 372.26MIN: 488.77MIN: 1969.211. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P400800120016002000Min: 396.31 / Avg: 403.5 / Max: 412.97Min: 494.38 / Avg: 494.94 / Max: 495.62Min: 2020.17 / Avg: 2040.28 / Max: 2052.911. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P6001200180024003000SE +/- 9.19, N = 3SE +/- 3.75, N = 3SE +/- 15.68, N = 31391.482989.042386.59MIN: 1356.81MIN: 2929.55MIN: 2340.461. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P5001000150020002500Min: 1378.67 / Avg: 1391.48 / Max: 1409.3Min: 2982.56 / Avg: 2989.04 / Max: 2995.54Min: 2359.06 / Avg: 2386.59 / Max: 2413.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P1.0622.1243.1864.2485.31SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 152.891.244.72MIN: 2.61MIN: 1.17MIN: 3.861. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P246810Min: 2.86 / Avg: 2.89 / Max: 2.9Min: 1.24 / Avg: 1.24 / Max: 1.25Min: 4.29 / Avg: 4.72 / Max: 5.161. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P3691215SE +/- 0.08, N = 15SE +/- 0.00, N = 4SE +/- 0.06, N = 33.251.159.37MIN: 2.46MIN: 9.061. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P3691215Min: 2.66 / Avg: 3.25 / Max: 3.59Min: 1.15 / Avg: 1.15 / Max: 1.16Min: 9.28 / Avg: 9.37 / Max: 9.491. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P4080120160200SE +/- 0.40, N = 15SE +/- 0.28, N = 3SE +/- 0.33, N = 347.4349.79192.49MIN: 42.56MIN: 48.58MIN: 183.911. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P4080120160200Min: 45.21 / Avg: 47.43 / Max: 49.24Min: 49.34 / Avg: 49.79 / Max: 50.3Min: 191.85 / Avg: 192.49 / Max: 192.931. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P4K8K12K16K20KSE +/- 13.71, N = 3SE +/- 1.30, N = 3SE +/- 198.89, N = 59100.131521.7216846.88MIN: 8785.76MIN: 15888.11. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P3K6K9K12K15KMin: 9073.96 / Avg: 9100.13 / Max: 9120.31Min: 1519.7 / Avg: 1521.72 / Max: 1524.14Min: 16199.6 / Avg: 16846.88 / Max: 17302.51. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_all - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P7001400210028003500SE +/- 10.57, N = 3SE +/- 0.45, N = 3SE +/- 101.40, N = 9837.06877.843470.77MIN: 769.65MIN: 871.06MIN: 3032.191. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_all - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P6001200180024003000Min: 815.93 / Avg: 837.06 / Max: 847.82Min: 877.07 / Avg: 877.84 / Max: 878.62Min: 3234.24 / Avg: 3470.77 / Max: 4249.251. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P2004006008001000SE +/- 1.54, N = 3SE +/- 0.00, N = 3SE +/- 1.72, N = 3517.950.44900.06MIN: 505.29MIN: 888.121. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P160320480640800Min: 515.41 / Avg: 517.95 / Max: 520.73Min: 0.43 / Avg: 0.44 / Max: 0.44Min: 897.01 / Avg: 900.06 / Max: 902.941. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P400800120016002000SE +/- 0.69, N = 3SE +/- 0.80, N = 3SE +/- 1.78, N = 3846.191881.871526.11MIN: 840.22MIN: 1867.97MIN: 1505.071. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P30060090012001500Min: 845.48 / Avg: 846.19 / Max: 847.57Min: 1880.32 / Avg: 1881.87 / Max: 1883.02Min: 1522.7 / Avg: 1526.11 / Max: 1528.671. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P2004006008001000SE +/- 6.96, N = 12SE +/- 2.99, N = 4SE +/- 6.47, N = 15814.70223.03709.12MIN: 752.4MIN: 207.46MIN: 637.591. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P140280420560700Min: 787.93 / Avg: 814.7 / Max: 864.38Min: 214.91 / Avg: 223.03 / Max: 229.36Min: 683.43 / Avg: 709.12 / Max: 754.881. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P30060090012001500SE +/- 0.96, N = 3SE +/- 0.17, N = 3SE +/- 3.94, N = 3540.5219.511179.54MIN: 525.23MIN: 1154.961. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P2004006008001000Min: 538.8 / Avg: 540.52 / Max: 542.11Min: 19.17 / Avg: 19.51 / Max: 19.71Min: 1174.05 / Avg: 1179.54 / Max: 1187.191. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P306090120150SE +/- 0.26, N = 3SE +/- 0.10, N = 3SE +/- 0.24, N = 329.3823.08116.50MIN: 26.98MIN: 21.92MIN: 101.761. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P20406080100Min: 28.86 / Avg: 29.38 / Max: 29.66Min: 22.91 / Avg: 23.08 / Max: 23.25Min: 116.22 / Avg: 116.5 / Max: 116.991. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P400800120016002000SE +/- 1.21, N = 3SE +/- 0.03, N = 3SE +/- 51.26, N = 91580.497.681673.36MIN: 1487.89MIN: 1293.521. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32EPYC 7742 2PXeon Platinum 8280 2PEPYC 7601 2P30060090012001500Min: 1578.56 / Avg: 1580.49 / Max: 1582.73Min: 7.62 / Avg: 7.68 / Max: 7.73Min: 1328.9 / Avg: 1673.36 / Max: 1874.831. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl