NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark npb.

Project Site

nas.nasa.gov

Test Created

6 December 2010

Last Updated

22 May 2021

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

30 Seconds

Average Run Time

2 Minutes, 34 Seconds

Test Dependencies

C/C++ Compiler Toolchain + Fortran + OpenMPI

Accolades

250k+ Downloads + 5k+ Public Benchmark Results

Supported Platforms


Public Result Uploads *Reported Test Completions **Reported Installs **Test Profile Page Views ***OpenBenchmarking.orgEventsNAS Parallel Benchmarks Popularity Statisticspts/npb2010.122011.042011.082011.122012.042012.082012.122013.042013.082013.122014.042014.082014.122015.042015.082015.122016.042016.082016.122017.042017.082017.122018.042018.082018.122019.042019.082019.122020.042020.082020.122021.042021.0820K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data current as of 15 September 2021.
EP.C13.7%BT.C11.7%CG.C9.3%LU.C15.4%FT.C11.9%MG.C12.2%SP.B10.8%EP.D15.0%Test / Class Option PopularityOpenBenchmarking.org

Revision History

pts/npb-1.4.4   [View Source]   Sat, 22 May 2021 17:40:59 GMT
Add sp.c, adjust process count per https://www.phoronix.com/forums/forum/phoronix/phoronix-test-suite/1257554-how-to-enable-verbose-output#post1257618

pts/npb-1.4.2   [View Source]   Thu, 21 Jan 2021 12:22:20 GMT
Build fix for GCC 10 gfortran/

pts/npb-1.4.1   [View Source]   Sat, 07 Mar 2020 12:22:08 GMT
Add --alow-run-as-root, ended up being missed until now.

pts/npb-1.4.0   [View Source]   Wed, 28 Aug 2019 11:43:04 GMT
Update against upstream NPB 3.4, add new test cases.

pts/npb-1.3.1   [View Source]   Tue, 15 Jan 2019 11:37:52 GMT
Update against upstream NPB 3.1.1

pts/npb-1.3.0   [View Source]   Fri, 09 Nov 2018 12:39:01 GMT
Use physical core count rather than logical cores to avoid MPI breaking on high core count systems with SMT.

pts/npb-1.2.4   [View Source]   Sat, 09 Sep 2017 10:30:50 GMT
ft.B not ft.C size

pts/npb-1.2.3   [View Source]   Sat, 09 Sep 2017 10:15:47 GMT
Update MPI handling, add some larger test sizes

pts/npb-1.2.2   [View Source]   Thu, 25 Aug 2016 13:51:56 GMT
Update CFLAGS handling

pts/npb-1.2.1   [View Source]   Thu, 27 Nov 2014 18:42:33 GMT
MG.B and IS.D do not build / run properly on modern systems.

pts/npb-1.2.0   [View Source]   Tue, 05 Nov 2013 22:57:21 GMT
Working on new version of NPB test that makes use of some of the HPCC test profile MPI improvements and other multi-core/cluster work for better benchmarking. Seems to have MPI rank errors right now though for NPB on this build.

pts/npb-1.1.1   [View Source]   Sun, 10 Jun 2012 19:11:43 GMT
Remove tests not in MPI NPB version.

pts/npb-1.1.0   [View Source]   Sun, 10 Jun 2012 16:56:25 GMT
Switch to the MPI version of NPB benchmarks.

pts/npb-1.0.0   [View Source]   Mon, 06 Dec 2010 15:00:08 GMT
Initial import into OpenBenchmarking.org

Suites Using This Test

Multi-Core

CPU Massive

Server CPU Tests

HPC - High Performance Computing

MPI Benchmarks


Performance Metrics

Analyze Test Configuration:

NAS Parallel Benchmarks 3.4

Test / Class: BT.C

OpenBenchmarking.org metrics for this test profile configuration based on 1,118 public results since 28 August 2019 with the latest data as of 20 September 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Total Mop/s (Average)
100th
12
256326 +/- 2718
99th
22
241333 +/- 4721
98th
6
235747 +/- 5760
97th
4
204335 +/- 2832
96th
24
194248 +/- 2884
93rd
7
153163 +/- 12884
93rd
4
143155 +/- 2972
92nd
5
138341 +/- 624
92nd
4
138314 +/- 136
90th
5
135658 +/- 561
90th
22
135251 +/- 3412
89th
5
128929 +/- 555
88th
3
126767 +/- 1206
88th
7
125205 +/- 976
86th
11
117230 +/- 159
86th
4
116742 +/- 17159
85th
7
111859 +/- 11773
85th
6
108709 +/- 577
83rd
9
96162 +/- 14031
83rd
12
96135 +/- 3953
80th
4
84740 +/- 11294
79th
4
82808 +/- 2834
79th
4
78969 +/- 1208
78th
3
77911 +/- 763
Mid-Tier
75th
< 74816
74th
4
72730 +/- 1046
74th
8
72484 +/- 8054
73rd
3
68583 +/- 8082
69th
4
66106 +/- 1049
67th
5
60196 +/- 1290
66th
5
59182 +/- 8275
66th
5
58005 +/- 8038
64th
4
54289 +/- 5506
64th
14
53641 +/- 3706
62nd
3
51018 +/- 399
59th
3
45978 +/- 4433
57th
10
44401 +/- 853
56th
20
43109 +/- 6126
Median
50th
29988
49th
5
27907 +/- 2061
48th
10
26303 +/- 163
47th
36
25376 +/- 1713
44th
4
24456 +/- 16
43rd
26
24293 +/- 588
42nd
9
23619 +/- 1573
41st
3
23457 +/- 63
40th
14
22635 +/- 212
39th
3
21279 +/- 2628
38th
3
20408 +/- 193
38th
6
20085 +/- 15
36th
4
19195 +/- 74
35th
7
19176 +/- 260
35th
9
19110 +/- 374
35th
3
19057 +/- 172
34th
5
18811 +/- 62
34th
3
18792 +/- 57
32nd
3
18556 +/- 32
32nd
3
18415 +/- 1490
32nd
5
18271 +/- 579
32nd
4
18259 +/- 1285
31st
10
18215 +/- 2233
31st
4
17893 +/- 278
31st
6
17531 +/- 1121
29th
4
16537 +/- 1116
29th
12
16356 +/- 207
27th
3
16123 +/- 41
27th
3
16060 +/- 864
27th
4
15967 +/- 38
26th
12
15620 +/- 1244
Low-Tier
25th
< 15542
25th
6
15428 +/- 103
25th
4
15320 +/- 40
23rd
4
14827 +/- 661
22nd
5
14397 +/- 475
21st
4
14195 +/- 698
21st
9
14144 +/- 51
21st
16
14142 +/- 168
19th
5
14058 +/- 1152
18th
4
13516 +/- 23
18th
3
13363 +/- 9
17th
7
13003 +/- 156
17th
9
12888 +/- 112
16th
4
12672 +/- 57
14th
6
11963 +/- 592
14th
3
11872 +/- 589
14th
3
11568 +/- 533
14th
3
11507 +/- 151
14th
4
11436 +/- 1494
13th
7
11066 +/- 214
12th
3
10313 +/- 21
11th
9
10260 +/- 197
11th
3
9592 +/- 15
10th
4
9499 +/- 40
9th
4
8790 +/- 290
8th
8
7625 +/- 177
7th
3
6384 +/- 91
6th
4
4108 +/- 8
OpenBenchmarking.orgDistribution Of Public Results - Test / Class: BT.C1112 Results Range From 90 To 259688 Total Mop/s905282104741566620858260503124236434416264681852010572026239467586727787797083162883549354698738103930109122114314119506124698129890135082140274145466150658155850161042166234171426176618181810187002192194197386202578207770212962218154223346228538233730238922244114249306254498259690306090120150

Based on OpenBenchmarking.org data, the selected test / test configuration (NAS Parallel Benchmarks 3.4 - Test / Class: BT.C) has an average run-time of 9 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkTest / Class: BT.CRun-Time4080120160200Min: 1 / Avg: 8.63 / Max: 207

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.2%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsTest / Class: BT.CDeviation246810Min: 0 / Avg: 0.19 / Max: 3

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

AMDIntelOpenBenchmarking.orgRelative Core Scaling To BaseNAS Parallel Benchmarks CPU Core ScalingTest / Class: BT.C468121624324864128816243240

Recent Test Results

OpenBenchmarking.org Results Compare

2 Systems - 52 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 51 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 50 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 49 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

3 Systems - 192 Benchmark Results

2 x Intel Xeon Gold 5220R - TYAN S7106 - Intel Sky Lake-E DMI3 Registers

Ubuntu 20.04 - 5.9.0-050900rc6-generic - GNOME Shell 3.36.4

2 Systems - 48 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 47 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 46 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 140 Benchmark Results

AMD Ryzen 9 5900X 12-Core - 8GB - 2 x 275GB Virtual Disk

Ubuntu 20.04 - 5.10.16.3-microsoft-standard-WSL2 - Wayland

2 Systems - 45 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 44 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 43 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 39 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 38 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

2 Systems - 34 Benchmark Results

ARMv8 rev 0 - e3360_1099 - 32GB

Ubuntu 20.04 - 5.10.41-tegra - X Server

Most Popular Test Results

OpenBenchmarking.org Results Compare

16 Systems - 119 Benchmark Results

2 x Intel Xeon Platinum 8259L - ASRockRack EP2C621D16-4LP - Intel Sky Lake-E DMI3 Registers

Ubuntu 19.10 - 5.3.0-64-generic - GNOME Shell 3.34.1

3 Systems - 9 Benchmark Results

2 x Intel Xeon Platinum 8280 - GIGABYTE MD61-SC2-00 v01000100 - Intel Sky Lake-E DMI3 Registers

Ubuntu 19.04 - 5.3.0-999-generic - GNOME Shell 3.32.2

3 Systems - 173 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - 52GB - 2 x 275GB Virtual Disk

Ubuntu 20.04 - 4.19.104-microsoft-standard - X Server

3 Systems - 301 Benchmark Results

Intel Core i5-4670 - MSI B85M-P33 - Intel 4th Gen Core DRAM

Ubuntu 20.04 - 5.4.0-40-generic - GNOME Shell 3.36.3

2 Systems - 150 Benchmark Results

AMD Ryzen 7 4700U - LENOVO LNVNB161216 - AMD Renoir Root Complex

Ubuntu 20.04 - 5.7.0-999-generic - GNOME Shell 3.36.1

2 Systems - 269 Benchmark Results

AMD Ryzen Threadripper 3990X 64-Core - System76 Thelio Major - AMD Starship

Pop 20.04 - 5.4.0-7626-generic - GNOME Shell 3.36.1

3 Systems - 143 Benchmark Results

AMD EPYC 7742 64-Core - AMD DAYTONA_X - AMD Starship

Ubuntu 20.04 - 5.4.0-31-generic - GNOME Shell 3.36.1

3 Systems - 108 Benchmark Results

Intel Core i7-3770K - ECS Z77H2-A2X v1.0 - Intel Xeon E3-1200 v2

Ubuntu 20.04 - 5.4.0-58-generic - GNOME Shell 3.36.4

2 Systems - 59 Benchmark Results

AMD Ryzen 7 3700X 8-Core - MSI MEG X570 GODLIKE - AMD Device 1480

Clear Linux OS 31480 - 5.3.8-854.native - GNOME Shell 3.34.1

2 Systems - 1708 Benchmark Results

Intel Core i3-10100 - ASUS PRIME Z490M-PLUS - Intel Comet Lake PCH

Ubuntu 20.04 - 5.7.0-rc6-amd-energy - GNOME Shell 3.36.2

6 Systems - 76 Benchmark Results

Intel Core i7-5600U - LENOVO 20BSCTO1WW - Intel Broadwell-U-OPI

Ubuntu 19.10 - 5.3.0-19-generic - GNOME Shell 3.34.1

11 Systems - 72 Benchmark Results

Intel Core i9-10980XE - Gigabyte X299X DESIGNARE 10G - Intel Sky Lake-E DMI3 Registers

Debian 10 - 4.19.0-6-amd64 - GNOME Shell 3.30.2

Find More Test Results