ACES DGEMM

This is a multi-threaded DGEMM benchmark.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark mt-dgemm.

Project Site

lanl.gov

Test Created

11 October 2019

Last Updated

11 October 2019

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Second

Average Run Time

16 Minutes, 41 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

5k+ Downloads

Supported Platforms


Public Result UploadsReported Installs*Test Completions*OpenBenchmarking.orgEventsACES DGEMM Popularity Statisticspts/mt-dgemm2019.102019.112019.122020.012020.022020.032020.042020.052020.062020.072020.082020.092020.102020.112020.122021.012021.022021.032004006008001000
* Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data current as of Wed, 03 Mar 2021 06:40:35 GMT.

Revision History

pts/mt-dgemm-1.2.0   [View Source]   Fri, 11 Oct 2019 15:29:15 GMT
Initial commit of ACES DGEMM

Suites Using This Test

Multi-Core

Scientific Computing

HPC - High Performance Computing

Linear Algebra

Programmer / Developer System Benchmarks


Performance Metrics

Analyze Test Configuration:

ACES DGEMM 1.0

Sustained Floating-Point Rate

OpenBenchmarking.org metrics for this test profile configuration based on 1,035 public results since 11 October 2019 with the latest data as of 3 March 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Matching Public Results
GFLOP/s (Average)
100th
23
27.3 +/- 3.4
99th
5
20.5 +/- 0.3
96th
11
19.8 +/- 0.7
94th
6
17.5 +/- 1.0
92nd
10
16.7 +/- 0.2
91st
5
16.2 +/- 0.3
90th
10
15.4 +/- 0.3
88th
31
13.9 +/- 0.2
86th
7
13.6 +/- 0.4
86th
7
13.4 +/- 0.4
85th
4
13.3 +/- 1.6
84th
6
12.6 +/- 0.5
83rd
7
12.2 +/- 0.1
82nd
6
12.1 +/- 0.6
81st
6
11.7 +/- 0.8
79th
3
10.5 +/- 0.4
79th
7
10.4 +/- 1.2
79th
6
10.2 +/- 0.6
78th
9
9.8 +/- 0.1
Mid-Tier
75th
< 9.4
75th
8
9.4 +/- 0.4
73rd
12
9.2 +/- 0.4
72nd
7
9.1 +/- 0.2
70th
7
8.9 +/- 0.1
69th
39
8.7 +/- 0.8
67th
3
7.9 +/- 0.1
66th
8
7.6 +/- 0.2
62nd
13
7.2 +/- 0.1
62nd
3
7.2 +/- 0.2
61st
9
6.8 +/- 0.3
59th
3
6.6 +/- 0.2
59th
3
6.5 +/- 0.2
57th
24
6.3 +/- 0.2
55th
9
5.9 +/- 0.6
53rd
6
5.7 +/- 0.4
53rd
15
5.5 +/- 0.1
Median
50th
5.3
50th
11
5.2 +/- 0.2
49th
3
5.0 +/- 0.3
48th
3
5.0 +/- 0.1
46th
14
4.7 +/- 0.5
44th
6
4.4 +/- 0.3
44th
6
4.4 +/- 0.1
41st
5
4.2 +/- 0.2
39th
29
4.1 +/- 0.5
37th
10
3.8 +/- 0.3
36th
5
3.6 +/- 0.4
35th
7
3.5 +/- 0.1
32nd
10
3.0 +/- 0.1
28th
9
2.6 +/- 0.2
28th
4
2.5 +/- 0.1
Low-Tier
25th
< 2.3
25th
10
2.3 +/- 0.1
25th
6
2.3 +/- 0.1
24th
5
2.1 +/- 0.1
19th
16
1.6 +/- 0.1
16th
7
1.3 +/- 0.2
OpenBenchmarking.orgDistribution Of Public Results - Sustained Floating-Point Rate1035 Results Range From 0 To 30 GFLOP/s036912151821242730333670140210280350

Based on OpenBenchmarking.org data, the selected test / test configuration (ACES DGEMM 1.0 - Sustained Floating-Point Rate) has an average run-time of 7 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkSustained Floating-Point RateRun-Time1326395265Min: 1 / Avg: 6.67 / Max: 64

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1.4%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsSustained Floating-Point RateDeviation246810Min: 0 / Avg: 1.37 / Max: 6

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseACES DGEMM CPU Core ScalingSustained Floating-Point Rate468121620243248643691215

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
Used by default on supported hardware.
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VEXTRACTF128 VINSERTF128 VBROADCASTSD
FMA (FMA)
Used by default on supported hardware.
Found on Intel processors since Haswell (2013).
Found on AMD processors since Bulldozer (2011).

 
VFMADD132SD VFMADD231SD
The test / benchmark does honor compiler flag changes.
Last automated analysis: 30 January 2021

This test profile binary relies on the shared libraries libgomp.so.1, libpthread.so.0, libc.so.6, libdl.so.2.

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 7 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 6 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 4 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 7 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 5 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 4 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 6 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 7 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 7 Benchmark Results

Intel Core i5-10210U - Intel NUC10i5FNB - Intel Device 02ef

Ubuntu 20.04 - 5.4.0-65-generic - Xfce 4.14

1 System - 53 Benchmark Results

AMD Ryzen 5 3600 6-Core - ASUS PRIME X470-PRO - AMD Starship

Arch rolling - 5.10.16-zen1-1-zen - X Server 1.20.10

1 System - 358 Benchmark Results

Ampere Altra ARMv8 Neoverse-N1 - WIWYNN Mt.Jade - Ampere Computing LLC Device e100

Ubuntu 20.04 - 5.11.0-051100-generic-64k - GNOME Shell 3.36.4

1 System - 37 Benchmark Results

12 x AMD EPYC-Rome - QEMU Standard PC - Intel 82G33

Ubuntu 20.04 - 5.8.0-43-generic - GNOME Shell 3.36.4

Most Popular Test Results

OpenBenchmarking.org Results Compare

3 Systems - 268 Benchmark Results

Intel Core i5-2520M - HP 161C - Intel 2nd Generation Core DRAM

Ubuntu 18.04 - 4.18.0-20-generic - GNOME Shell 3.28.3

12 Systems - 593 Benchmark Results

AMD Ryzen 7 5800X 8-Core - Gigabyte X570 AORUS MASTER - AMD Starship

Fedora 33 - 5.8.16-300.fc33.x86_64 - GNOME Shell 3.38.1

11 Systems - 217 Benchmark Results

AMD Ryzen 9 3900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4

8 Systems - 360 Benchmark Results

AMD Ryzen 9 3900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 19.10 - 5.4.0-999-generic - GNOME Shell 3.34.1

15 Systems - 38 Benchmark Results

2 x Intel Xeon Gold 6258R - Supermicro X11DAi-N v1.10 - Intel Sky Lake-E DMI3 Registers

Fedora 32 - 5.6.14-300.fc32.x86_64 - GNOME Shell 3.36.2

3 Systems - 301 Benchmark Results

Intel Core i5-7600K - Gigabyte Z270M-D3H-CF - Intel Xeon E3-1200 v6

Ubuntu 20.04 - 5.4.0-40-generic - GNOME Shell 3.36.3

18 Systems - 115 Benchmark Results

Intel Core i9-7960X - MSI X299 SLI PLUS - Intel Sky Lake-E DMI3 Registers

Ubuntu 19.10 - 5.3.0-18-generic - GNOME Shell 3.34.1

7 Systems - 62 Benchmark Results

Intel Core i9-7960X - MSI X299 SLI PLUS - Intel Sky Lake-E DMI3 Registers

Ubuntu 18.04 - 5.0.0-32-generic - GNOME Shell 3.28.4

12 Systems - 229 Benchmark Results

Intel Core i5-10600K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH

Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4

Find More Test Results