ACES DGEMM

This is a multi-threaded DGEMM benchmark.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark mt-dgemm.

Project Site

lanl.gov

Test Created

11 October 2019

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

2 Seconds

Average Run Time

15 Minutes, 47 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

5k+ Downloads

Supported Platforms


Public Result UploadsReported Installs*Test Completions*OpenBenchmarking.orgEventsACES DGEMM Popularity Statisticspts/mt-dgemm2019.102019.112019.122020.012020.022020.032020.042020.052020.062020.072020.082020.092020.102020.112020.122021.012021.022021.032021.042021.0530060090012001500
* Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data current as of Tue, 11 May 2021 09:51:25 GMT.

Revision History

pts/mt-dgemm-1.2.0   [View Source]   Fri, 11 Oct 2019 15:29:15 GMT
Initial commit of ACES DGEMM

Suites Using This Test

Multi-Core

Scientific Computing

HPC - High Performance Computing

Linear Algebra

Programmer / Developer System Benchmarks


Performance Metrics

Analyze Test Configuration:

ACES DGEMM 1.0

Sustained Floating-Point Rate

OpenBenchmarking.org metrics for this test profile configuration based on 1,281 public results since 11 October 2019 with the latest data as of 10 May 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Matching Public Results
GFLOP/s (Average)
100th
12
38.4 +/- 0.4
100th
7
32.0 +/- 0.3
98th
27
27.4 +/- 3.2
98th
6
23.2 +/- 0.4
97th
10
21.5 +/- 0.1
95th
5
20.5 +/- 0.3
93rd
11
19.8 +/- 0.7
92nd
7
19.7 +/- 1.0
91st
5
18.8 +/- 0.1
90th
6
17.5 +/- 1.0
89th
12
16.7 +/- 0.2
88th
8
16.3 +/- 0.3
86th
12
15.5 +/- 0.3
86th
3
15.1 +/- 0.4
84th
33
13.9 +/- 0.2
82nd
7
13.6 +/- 0.4
82nd
7
13.4 +/- 0.4
82nd
4
13.3 +/- 1.6
81st
11
12.7 +/- 0.1
80th
6
12.6 +/- 0.5
79th
10
12.2 +/- 0.1
79th
6
12.1 +/- 0.6
78th
6
11.7 +/- 0.8
77th
9
11.3 +/- 0.3
77th
6
11.3 +/- 0.4
76th
4
11.2 +/- 0.2
Mid-Tier
75th
< 11.1
75th
5
10.7 +/- 0.4
74th
9
10.6 +/- 1.1
74th
6
10.2 +/- 0.6
72nd
11
9.8 +/- 0.1
70th
8
9.4 +/- 0.4
68th
14
9.2 +/- 0.3
67th
9
9.1 +/- 0.2
64th
7
8.9 +/- 0.1
64th
42
8.8 +/- 0.8
63rd
6
8.3 +/- 0.2
62nd
3
7.9 +/- 0.1
61st
8
7.6 +/- 0.2
58th
15
7.2 +/- 0.1
57th
3
7.2 +/- 0.2
56th
9
6.8 +/- 0.3
55th
3
6.6 +/- 0.2
55th
3
6.5 +/- 0.2
53rd
24
6.3 +/- 0.2
52nd
4
6.2 +/- 0.2
51st
6
6.0 +/- 0.2
51st
11
5.9 +/- 0.5
Median
50th
5.9
49th
6
5.7 +/- 0.4
48th
15
5.5 +/- 0.1
46th
13
5.2 +/- 0.2
45th
5
5.1 +/- 0.3
44th
3
5.0 +/- 0.1
43rd
13
4.9 +/- 0.3
43rd
16
4.7 +/- 0.5
40th
6
4.4 +/- 0.3
40th
6
4.4 +/- 0.1
38th
5
4.2 +/- 0.2
36th
29
4.1 +/- 0.5
34th
10
3.8 +/- 0.3
34th
5
3.6 +/- 0.4
32nd
9
3.5 +/- 0.1
31st
6
3.3 +/- 0.1
Low-Tier
25th
< 2.6
25th
10
2.6 +/- 0.2
25th
4
2.5 +/- 0.1
23rd
10
2.3 +/- 0.1
23rd
9
2.3 +/- 0.3
23rd
6
2.3 +/- 0.1
22nd
5
2.1 +/- 0.1
20th
4
1.8 +/- 0.2
19th
3
1.7 +/- 0.1
18th
18
1.6 +/- 0.1
15th
7
1.3 +/- 0.2
OpenBenchmarking.orgDistribution Of Public Results - Sustained Floating-Point Rate1281 Results Range From 0 To 40 GFLOP/s0481216202428323640444890180270360450

Based on OpenBenchmarking.org data, the selected test / test configuration (ACES DGEMM 1.0 - Sustained Floating-Point Rate) has an average run-time of 7 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkSustained Floating-Point RateRun-Time20406080100Min: 1 / Avg: 6.4 / Max: 93

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1.3%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsSustained Floating-Point RateDeviation3691215Min: 0 / Avg: 1.35 / Max: 8

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseACES DGEMM CPU Core ScalingSustained Floating-Point Rate4681216182024324864128612182430

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
Used by default on supported hardware.
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VEXTRACTF128 VINSERTF128
FMA (FMA)
Used by default on supported hardware.
Found on Intel processors since Haswell (2013).
Found on AMD processors since Bulldozer (2011).

 
VFMADD132SD VFMADD231SD
The test / benchmark does honor compiler flag changes.
Last automated analysis: 10 May 2021

This test profile binary relies on the shared libraries libgomp.so.1, libc.so.6, libdl.so.2, libpthread.so.0.

Recent Test Results

OpenBenchmarking.org Results Compare

3 Systems - 19 Benchmark Results

Intel Core i7-9750H - Dell 0CRKJ6 - Intel Cannon Lake PCH

Fedora 34 - 5.11.18-300.fc34.x86_64 - GNOME Shell 40.0

1 System - 3 Benchmark Results

Intel Core i7-9750H - Dell 0CRKJ6 - Intel Cannon Lake PCH

Fedora 34 - 5.12.2-xanmod1.0.fc34 - GNOME Shell 40.0

1 System - 19 Benchmark Results

Intel Core i7-9750H - Dell 0CRKJ6 - Intel Cannon Lake PCH

Fedora 34 - 5.11.18-300.fc34.x86_64 - GNOME Shell 40.0

1 System - 60 Benchmark Results

AMD Ryzen 9 5900X 12-Core - Gigabyte X570 AORUS MASTER - AMD Starship

Red Hat Enterprise Linux 8.3 - 4.18.0-240.22.1.el8_3.x86_64 - GNOME Shell 3.32.2

4 Systems - 14 Benchmark Results

Intel Core i5-5200U - ASUS X555LB v1.0 - Intel Broadwell-U-OPI

Fedora 34 - 5.11.17-300.fc34.x86_64 - GNOME Shell 40.0

3 Systems - 14 Benchmark Results

Intel Core i5-5200U - ASUS X555LB v1.0 - Intel Broadwell-U-OPI

Clear Linux OS 34560 - 5.10.33-1036.native - GNOME Shell 40.0

2 Systems - 14 Benchmark Results

Intel Core i5-5200U - ASUS X555LB v1.0 - Intel Broadwell-U-OPI

Clear Linux OS 34560 - 5.10.33-1036.native - GNOME Shell 40.0

1 System - 14 Benchmark Results

Intel Core i5-5200U - ASUS X555LB v1.0 - Intel Broadwell-U-OPI

Fedora 34 - 5.11.17-300.fc34.x86_64 - GNOME Shell 40.0

2 Systems - 169 Benchmark Results

AMD Ryzen 3 3200G - ASRock B450M-HDV R4.0 - AMD Raven

Clear Linux OS 34550 - 5.10.19-1032.native - GNOME Shell 40.0

Most Popular Test Results

OpenBenchmarking.org Results Compare

3 Systems - 268 Benchmark Results

Intel Core i5-2520M - HP 161C - Intel 2nd Generation Core DRAM

Ubuntu 18.04 - 4.18.0-20-generic - GNOME Shell 3.28.3

12 Systems - 593 Benchmark Results

AMD Ryzen 7 3800XT 8-Core - MSI MEG X570 GODLIKE - AMD Starship

Ubuntu 20.04 - 5.8.0-050800daily20200622-generic - GNOME Shell 3.36.2

11 Systems - 217 Benchmark Results

Intel Core i9-10900K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH

Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4

8 Systems - 360 Benchmark Results

Intel Core i9-10980XE - Gigabyte X299X DESIGNARE 10G - Intel Sky Lake-E DMI3 Registers

Ubuntu 19.10 - 5.4.0-999-generic - GNOME Shell 3.34.1

15 Systems - 38 Benchmark Results

2 x Intel Xeon Gold 6258R - Supermicro X11DAi-N v1.10 - Intel Sky Lake-E DMI3 Registers

Fedora 32 - 5.6.14-300.fc32.x86_64 - GNOME Shell 3.36.2

3 Systems - 301 Benchmark Results

Intel Core i5-10600K - ASUS PRIME Z490M-PLUS - Intel Comet Lake PCH

Ubuntu 20.04 - 5.4.0-40-generic - GNOME Shell 3.36.3

18 Systems - 115 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship

Fedora 30 - 5.2.6-200.fc30.x86_64 - GNOME Shell 3.32.2

7 Systems - 62 Benchmark Results

Intel Core i9-7960X - 15360MB - 2 x 275GB Virtual Disk

Ubuntu 18.04 - 4.19.75-microsoft-standard - GCC 7.4.0

12 Systems - 229 Benchmark Results

AMD Ryzen 7 3800XT 8-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4

2 Systems - 475 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship

Ubuntu 19.10 - 5.3.0-40-generic - GNOME Shell 3.34.1

Find More Test Results