oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark onednn.

Project Site

github.com

Test Created

17 June 2020

Last Updated

20 December 2020

Test Maintainer

Michael Larabel

Test Type

Processor

Average Install Time

8 Minutes, 38 Seconds

Average Run Time

2 Minutes, 2 Seconds

Test Dependencies

C/C++ Compiler Toolchain + CMake

Accolades

5k+ Downloads

Supported Platforms


Public Result UploadsReported Installs*Test Completions*OpenBenchmarking.orgEventsoneDNN Popularity Statisticspts/onednn2020.062020.072020.082020.092020.102020.112020.122021.014K8K12K16K20K
* Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data current as of Tue, 26 Jan 2021 07:06:13 GMT.
Deconvolution Batch shapes_3d11.4%Recurrent Neural Network Training15.9%IP Shapes 1D11.5%Convolution Batch Shapes Auto11.5%Matrix Multiply Batch Shapes Transformer11.3%Recurrent Neural Network Inference15.6%IP Shapes 3D11.5%Deconvolution Batch shapes_1d11.4%Harness Option PopularityOpenBenchmarking.org
bf16bf16bf1613.9%u8s8f3239.8%f3246.3%Data Type Option PopularityOpenBenchmarking.org

Revision History

pts/onednn-1.6.1   [View Source]   Sun, 20 Dec 2020 09:58:16 GMT
This test profile builds and works fine on macOS so enable it (MacOSX).

pts/onednn-1.6.0   [View Source]   Wed, 09 Dec 2020 13:47:31 GMT
Update against oneDNN 2.0 upstream.

pts/onednn-1.5.0   [View Source]   Wed, 17 Jun 2020 16:26:39 GMT
Initial commit of oneDNN test profile based on Intel oneDNN 1.5, forked from existing mkl-dnn test profile that is named from MKL-DNN before it was renamed to DNNL and then oneDNN. So create new test profile to match Intel naming convention.

Suites Using This Test

Multi-Core

Machine Learning

Intel oneAPI

CPU Massive

Server CPU Tests

Creator Workloads

HPC - High Performance Computing


Performance Metrics

Analyze Test Configuration:

oneDNN 2.0

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.org metrics for this test profile configuration based on 406 public results since 9 December 2020 with the latest data as of 26 January 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Matching Public Results
ms (Average)
97th
4
1434 +/- 7
95th
7
1483 +/- 20
94th
3
1560 +/- 4
93rd
3
1612 +/- 1
92nd
6
1646 +/- 2
90th
3
2097 +/- 1
90th
4
2187 +/- 126
86th
12
2731 +/- 245
84th
3
2858 +/- 26
83rd
6
2988 +/- 233
81st
4
3042 +/- 46
80th
10
3242 +/- 465
80th
3
3407 +/- 3
78th
8
3487 +/- 81
77th
3
3608 +/- 8
Mid-Tier
75th
> 3674
70th
6
3786 +/- 51
70th
10
3815 +/- 138
70th
4
3837 +/- 31
68th
6
3894 +/- 36
66th
6
3960 +/- 523
64th
8
4000 +/- 2
62nd
3
4023 +/- 3
62nd
3
4038 +/- 19
60th
3
4187 +/- 48
58th
3
4478 +/- 29
57th
3
4530 +/- 4
56th
9
4696 +/- 230
55th
3
4763 +/- 3
54th
4
4808 +/- 11
51st
3
5961 +/- 35
51st
3
6058 +/- 14
Median
50th
6074
50th
3
6241 +/- 22
49th
4
6374 +/- 26
47th
4
6702 +/- 25
47th
3
6761 +/- 1
46th
3
6772 +/- 3
44th
6
6867 +/- 32
44th
3
6877 +/- 13
43rd
3
7161 +/- 7
41st
3
7394 +/- 13
41st
3
7400 +/- 4
39th
3
7484 +/- 9
38th
3
7717 +/- 74
37th
3
7743 +/- 2
36th
3
7909 +/- 11
34th
4
8249 +/- 141
34th
3
8352 +/- 80
33rd
3
8451 +/- 43
30th
7
8915 +/- 111
29th
3
8935 +/- 26
26th
9
9304 +/- 348
Low-Tier
25th
> 9350
25th
3
9569 +/- 30
25th
6
9585 +/- 399
23rd
3
10525 +/- 140
23rd
3
10598 +/- 246
21st
6
11081 +/- 25
20th
3
12586 +/- 31
19th
3
12673 +/- 557
19th
4
12791 +/- 156
18th
3
12863 +/- 7
16th
7
13094 +/- 60
16th
3
13137 +/- 231
15th
3
13181 +/- 226
13th
3
13566 +/- 25
11th
3
14756 +/- 297
9th
3
20832 +/- 20
9th
3
21280 +/- 32
8th
3
22097 +/- 177
7th
7
31372 +/- 674
6th
3
31708 +/- 36
4th
4
44764 +/- 28
3rd
3
66778 +/- 8
2nd
4
68457 +/- 393
2nd
3
73528 +/- 27
1st
3
77508 +/- 465
OpenBenchmarking.orgDistribution Of Public Results - Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU406 Results Range From 985 To 77853 ms9852523406155997137867510213117511328914827163651790319441209792251724055255932713128669302073174533283348213635937897394354097342511440494558747125486635020151739532775481556353578915942960967625056404365581671196865770195717337327174809763477788520406080100

Based on OpenBenchmarking.org data, the selected test / test configuration (oneDNN 2.0 - Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU) has an average run-time of 7 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkHarness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPURun-Time612182430Min: 2 / Avg: 7.17 / Max: 28

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.8%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsHarness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUDeviation3691215Min: 0 / Avg: 0.82 / Max: 11

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

AMDIntelOpenBenchmarking.orgRelative Core Scaling To BaseoneDNN CPU Core ScalingHarness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU468121624321.082.163.244.325.4

Recent Test Results

OpenBenchmarking.org Results Compare

3 Systems - 376 Benchmark Results

2 x AMD EPYC 7F72 24-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.10 - 5.11.0-051100rc4daily20210122-generic - GNOME Shell 3.38.1

2 Systems - 220 Benchmark Results

AMD Ryzen 9 5950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.10 - 5.11.0-rc4-max-boost-inv-patch - GNOME Shell 3.38.1

3 Systems - 376 Benchmark Results

2 x AMD EPYC 7F72 24-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.10 - 5.11.0-rc4-max-boost-inv-patch - GNOME Shell 3.38.1

1 System - 466 Benchmark Results

2 x AMD EPYC 7F72 24-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.10 - 5.10.9-051009-generic - GNOME Shell 3.38.1

3 Systems - 74 Benchmark Results

2 x AMD EPYC 7601 32-Core - Dell 02MJ3T - AMD 17h

Ubuntu 19.10 - 5.9.0-050900rc6daily20200922-generic - GNOME Shell 3.34.1

2 Systems - 129 Benchmark Results

AMD Ryzen 9 5950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.10 - 5.11.0-051100rc2daily20210108-generic - GNOME Shell 3.38.1

3 Systems - 191 Benchmark Results

AMD Ryzen 3 2200G - ASUS PRIME B350M-E - AMD Raven

Ubuntu 20.10 - 5.8.0-38-generic - GNOME Shell 3.38.1

4 Systems - 104 Benchmark Results

AMD Ryzen 5 2400G - MSI B350M GAMING PRO - AMD Raven

Ubuntu 19.10 - 5.3.0-64-generic - GNOME Shell 3.34.1

4 Systems - 61 Benchmark Results

Intel Core i9-7960X - MSI X299 SLI PLUS - Intel Sky Lake-E DMI3 Registers

Ubuntu 20.04 - 5.4.0-58-generic - X Server 1.20.8

3 Systems - 113 Benchmark Results

Intel Core i9-7980XE - ASUS PRIME X299-A - Intel Sky Lake-E DMI3 Registers

Ubuntu 20.10 - 5.8.0-36-generic - GNOME Shell 3.38.1

Most Popular Test Results

OpenBenchmarking.org Results Compare

3 Systems - 74 Benchmark Results

2 x AMD EPYC 7601 32-Core - Dell 02MJ3T - AMD 17h

Ubuntu 19.10 - 5.9.0-050900rc6daily20200922-generic - GNOME Shell 3.34.1

3 Systems - 191 Benchmark Results

AMD Ryzen 3 2200G - ASUS PRIME B350M-E - AMD Raven

Ubuntu 20.10 - 5.8.0-38-generic - GNOME Shell 3.38.1

3 Systems - 19 Benchmark Results

AMD Ryzen Threadripper 3990X 64-Core - System76 Thelio Major - AMD Starship

Pop 20.10 - 5.8.0-7625-generic - GNOME Shell 3.38.1

3 Systems - 376 Benchmark Results

2 x AMD EPYC 7F72 24-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.10 - 5.11.0-051100rc4daily20210122-generic - GNOME Shell 3.38.1

3 Systems - 23 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship

Ubuntu 20.10 - 5.8.0-29-generic - GNOME Shell 3.38.1

4 Systems - 104 Benchmark Results

AMD Ryzen 5 2400G - MSI B350M GAMING PRO - AMD Raven

Ubuntu 19.10 - 5.3.0-64-generic - GNOME Shell 3.34.1

2 Systems - 220 Benchmark Results

AMD Ryzen 9 5950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.10 - 5.11.0-rc4-max-boost-inv-patch - GNOME Shell 3.38.1

4 Systems - 18 Benchmark Results

AMD Ryzen 7 5800X 8-Core - ASRock X570 Pro4 - AMD Starship

Ubuntu 20.10 - 5.8.0-31-generic - GNOME Shell 3.38.1

4 Systems - 35 Benchmark Results

Apple M1 - Apple Mac mini - 8GB

macOS 11.1 - 20.2.0 - OpenCL 1.2

3 Systems - 148 Benchmark Results

Intel Core i7-4790K - Gigabyte Z97-HD3P - Intel 4th Gen Core DRAM

Ubuntu 19.10 - 5.9.0-050900rc8daily20201009-generic - GNOME Shell 3.34.1

Find More Test Results

OpenBenchmarking.org Community User Comments

Post A Comment