ACES DGEMM

This is a multi-threaded DGEMM benchmark.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark mt-dgemm.

Project Site

lanl.gov

Test Created

11 October 2019

Last Updated

11 October 2019

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Second

Average Run Time

16 Minutes, 29 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

5k+ Downloads

Supported Platforms


Public Result UploadsReported Installs*Test Completions*OpenBenchmarking.orgEventsACES DGEMM Popularity Statisticspts/mt-dgemm2019.102019.112019.122020.012020.022020.032020.042020.052020.062020.072020.082020.092020.102020.112020.122021.012021.022004006008001000
* Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data current as of Fri, 26 Feb 2021 20:12:26 GMT.

Revision History

pts/mt-dgemm-1.2.0   [View Source]   Fri, 11 Oct 2019 15:29:15 GMT
Initial commit of ACES DGEMM

Suites Using This Test

Multi-Core

Scientific Computing

HPC - High Performance Computing

Linear Algebra

Programmer / Developer System Benchmarks


Performance Metrics

Analyze Test Configuration:

ACES DGEMM 1.0

Sustained Floating-Point Rate

OpenBenchmarking.org metrics for this test profile configuration based on 1,007 public results since 11 October 2019 with the latest data as of 24 February 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Matching Public Results
GFLOP/s (Average)
100th
22
27.3 +/- 3.5
99th
5
20.5 +/- 0.3
96th
11
19.8 +/- 0.7
94th
6
17.5 +/- 1.0
92nd
9
16.7 +/- 0.2
91st
4
16.2 +/- 0.3
90th
9
15.4 +/- 0.3
88th
30
13.9 +/- 0.2
86th
7
13.6 +/- 0.4
86th
7
13.4 +/- 0.4
86th
4
13.3 +/- 1.6
85th
8
12.7 +/- 0.1
84th
6
12.6 +/- 0.5
83rd
7
12.2 +/- 0.1
82nd
6
12.1 +/- 0.6
81st
6
11.7 +/- 0.8
79th
6
10.2 +/- 0.6
79th
6
10.2 +/- 1.2
78th
8
9.8 +/- 0.1
76th
8
9.4 +/- 0.4
Mid-Tier
75th
< 9.4
73rd
11
9.2 +/- 0.4
73rd
6
9.2 +/- 0.1
70th
7
8.9 +/- 0.1
70th
39
8.7 +/- 0.8
68th
3
7.9 +/- 0.1
66th
8
7.6 +/- 0.2
62nd
12
7.2 +/- 0.1
62nd
3
7.2 +/- 0.2
61st
9
6.8 +/- 0.3
59th
3
6.6 +/- 0.2
59th
3
6.5 +/- 0.2
57th
24
6.3 +/- 0.2
55th
9
5.9 +/- 0.6
53rd
6
5.7 +/- 0.4
53rd
15
5.5 +/- 0.1
Median
50th
5.3
50th
11
5.2 +/- 0.2
49th
3
5.0 +/- 0.3
48th
3
5.0 +/- 0.1
46th
13
4.7 +/- 0.6
44th
6
4.4 +/- 0.3
44th
6
4.4 +/- 0.1
41st
5
4.2 +/- 0.2
39th
29
4.1 +/- 0.5
37th
10
3.8 +/- 0.3
36th
5
3.6 +/- 0.4
35th
6
3.5 +/- 0.1
32nd
9
3.0 +/- 0.1
28th
9
2.6 +/- 0.2
27th
4
2.5 +/- 0.1
Low-Tier
25th
< 2.3
25th
10
2.3 +/- 0.1
25th
6
2.3 +/- 0.1
23rd
5
2.1 +/- 0.1
19th
15
1.6 +/- 0.1
16th
7
1.3 +/- 0.2
OpenBenchmarking.orgDistribution Of Public Results - Sustained Floating-Point Rate1007 Results Range From 0 To 30 GFLOP/s036912151821242730333670140210280350

Based on OpenBenchmarking.org data, the selected test / test configuration (ACES DGEMM 1.0 - Sustained Floating-Point Rate) has an average run-time of 7 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkSustained Floating-Point RateRun-Time1326395265Min: 1 / Avg: 6.31 / Max: 64

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1.4%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsSustained Floating-Point RateDeviation246810Min: 0 / Avg: 1.39 / Max: 6

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseACES DGEMM CPU Core ScalingSustained Floating-Point Rate4681216202432486448121620

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
Used by default on supported hardware.
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VEXTRACTF128 VINSERTF128 VBROADCASTSD
FMA (FMA)
Used by default on supported hardware.
Found on Intel processors since Haswell (2013).
Found on AMD processors since Bulldozer (2011).

 
VFMADD132SD VFMADD231SD
The test / benchmark does honor compiler flag changes.
Last automated analysis: 30 January 2021

This test profile binary relies on the shared libraries libgomp.so.1, libpthread.so.0, libc.so.6, libdl.so.2.

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 53 Benchmark Results

AMD Ryzen 5 3600 6-Core - ASUS PRIME X470-PRO - AMD Starship

Arch rolling - 5.10.16-zen1-1-zen - X Server 1.20.10

1 System - 358 Benchmark Results

Ampere Altra ARMv8 Neoverse-N1 - WIWYNN Mt.Jade - Ampere Computing LLC Device e100

Ubuntu 20.04 - 5.11.0-051100-generic-64k - GNOME Shell 3.36.4

1 System - 37 Benchmark Results

12 x AMD EPYC-Rome - QEMU Standard PC - Intel 82G33

Ubuntu 20.04 - 5.8.0-43-generic - GNOME Shell 3.36.4

1 System - 2393 Benchmark Results

AMD Ryzen 7 PRO 4750G - ASRock A520M-ITX/ac - AMD Renoir Root Complex

Gentoo - 5.10.16 - amd

1 System - 2388 Benchmark Results

AMD Ryzen 7 PRO 4750G - ASRock A520M-ITX/ac - AMD Renoir Root Complex

Gentoo - 5.10.16 - amd

1 System - 2326 Benchmark Results

AMD Ryzen 7 PRO 4750G - ASRock A520M-ITX/ac - AMD Renoir Root Complex

Gentoo - 5.10.15 - GCC 10.2.0 + Clang 11.0.0 + LLVM 11.0.0

1 System - 158 Benchmark Results

AMD Ryzen 7 3700X 8-Core - MSI A520M-A PRO - AMD Starship

Fedora 33 - 5.10.14-200.fc33.x86_64 - Clang 11.0.0

1 System - 2276 Benchmark Results

1 System - 167 Benchmark Results

AMD Ryzen Threadripper 2950X 16-Core - ASRock X399 Professional Gaming - AMD 17h

Ubuntu 16.04 - 4.19.174-custom - X Server 1.19.6

1 System - 143 Benchmark Results

2 x Intel Xeon Platinum 8280 - GIGABYTE MD61-SC2-00 v01000100 - Intel Sky Lake-E DMI3 Registers

Ubuntu 19.04 - 5.0.0-38-generic - GNOME Shell 3.32.2

4 Systems - 198 Benchmark Results

Intel Xeon E-2278GEL - Logic Supply RXM-181 TBD by OEM - Intel

FreeBSD - 12.2-RELEASE - Clang 10.0.1

1 System - 2 Benchmark Results

Intel Xeon E3-1225 v5 - PFU LIMITED MBE-561A - Intel Xeon E3-1200 v5

CentOS Linux 7 - 3.10.0-957.el7.x86_64 - GNOME Shell 3.28.3

Most Popular Test Results

OpenBenchmarking.org Results Compare

3 Systems - 268 Benchmark Results

Intel Core i5-2520M - HP 161C - Intel 2nd Generation Core DRAM

Ubuntu 18.04 - 4.18.0-20-generic - GNOME Shell 3.28.3

12 Systems - 593 Benchmark Results

AMD Ryzen 9 3900XT 12-Core - MSI MEG X570 GODLIKE - AMD Starship

Ubuntu 20.04 - 5.8.0-050800daily20200622-generic - GNOME Shell 3.36.2

11 Systems - 217 Benchmark Results

AMD Ryzen 9 3900XT 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4

8 Systems - 360 Benchmark Results

AMD Ryzen Threadripper 3960X 24-Core - MSI Creator TRX40 - AMD Starship

Ubuntu 19.10 - 5.4.0-999-generic - GNOME Shell 3.34.1

15 Systems - 38 Benchmark Results

2 x Intel Xeon Platinum 8124M - GIGABYTE MR91-FS0-00 v01000100 - Intel Sky Lake-E DMI3 Registers

Fedora 32 - 5.6.14-300.fc32.x86_64 - GNOME Shell 3.36.2

3 Systems - 301 Benchmark Results

Intel Core i5-4670 - MSI B85M-P33 - Intel 4th Gen Core DRAM

Ubuntu 20.04 - 5.4.0-40-generic - GNOME Shell 3.36.3

18 Systems - 115 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship

Fedora 30 - 5.3.8-200.local.fc30.x86_64 - GNOME Shell 3.32.2

7 Systems - 62 Benchmark Results

Intel Core i9-7960X - 16384MB - 238GB

Ubuntu 18.04 - 4.4.0-18362-Microsoft - GCC 7.4.0

2 Systems - 475 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship

Ubuntu 19.10 - 5.3.0-18-generic - GNOME Shell 3.34.1

Find More Test Results

OpenBenchmarking.org Community User Comments

Post A Comment