CUDA Mini-Nbody

The CUDA version of Harrism's mini-nbody tests.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark cuda-mini-nbody.

Project Site

github.com

Test Created

10 November 2015

Last Updated

15 March 2019

Test Maintainer

Michael Larabel 

Test Type

Graphics

Average Install Time

2 Seconds

Average Run Time

38 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

90k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsCUDA Mini-Nbody Popularity Statisticspts/cuda-mini-nbody2015.112016.022016.052016.082016.112017.022017.052017.082017.112018.022018.052018.082018.112019.022019.052019.082019.112020.022020.052020.082020.112021.022021.052021.082021.112022.022022.052022.082022.1120K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data current as of 5 December 2022.
Cache Blocking18.4%Flush Denormals To Zero17.8%SOA Data Layout18.1%Original27.1%Loop Unrolling18.6%Test Option PopularityOpenBenchmarking.org

Revision History

pts/cuda-mini-nbody-1.1.1   [View Source]   Fri, 15 Mar 2019 16:35:43 GMT
Fix MIB to HIB proportion.

pts/cuda-mini-nbody-1.1.0   [View Source]   Fri, 07 Dec 2018 14:52:29 GMT
Update https://github.com/phoronix-test-suite/test-profiles/pull/22

pts/cuda-mini-nbody-1.0.1   [View Source]   Sat, 11 Jun 2016 12:14:06 GMT
Set /usr/local/cuda/bin in PATH when needed

pts/cuda-mini-nbody-1.0.0   [View Source]   Tue, 10 Nov 2015 13:41:29 GMT
CUDA mini Nbody


Performance Metrics

Analyze Test Configuration:

CUDA Mini-Nbody 2015-11-10

Test: Original

OpenBenchmarking.org metrics for this test profile configuration based on 199 public results since 7 December 2018 with the latest data as of 7 November 2022.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
(NBody^2)/s (Average)
Mid-Tier
75th
< 389
Median
50th
128
OpenBenchmarking.orgDistribution Of Public Results - Test: Original199 Results Range From 3 To 115325 (NBody^2)/s323104617692492311153813845161521845920766230732538027687299943230134608369153922241529438364614348450507575306455371576785998562292645996690669213715207382776134784418074883055853628766989976922839459096897992041015111038181061251084321107391130461153534080120160200

Based on OpenBenchmarking.org data, the selected test / test configuration (CUDA Mini-Nbody 2015-11-10 - Test: Original) has an average run-time of 5 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkTest: OriginalRun-Time612182430Min: 1 / Avg: 4.81 / Max: 27

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.4%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsTest: OriginalDeviation246810Min: 0 / Avg: 0.43 / Max: 3

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
ARMv8 64-bit
aarch64
ARMv8 Cortex-A57 4-Core, ARMv8 Cortex-A57 6-Core, ARMv8 rev 0 2-Core, ARMv8 rev 0 4-Core, ARMv8 rev 0 6-Core, ARMv8 rev 0 8-Core, ARMv8 rev 1 2-Core, ARMv8 rev 1 4-Core, ARMv8 rev 3 4-Core

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 4 Benchmark Results

Intel Core i7-7820X - MSI X299 SLI PLUS - Intel Sky Lake-E DMI3 Registers

Ubuntu 22.04 - 5.15.0-52-generic - KDE Plasma 5.24.6

2 Systems - 7 Benchmark Results

AMD Ryzen 7 5700G - ASUS ROG STRIX B550-I GAMING - AMD Renoir

ManjaroLinux 22.0.0 - 5.19.16-2-MANJARO - X Server 1.21.1.4

1 System - 5 Benchmark Results

AMD Ryzen 7 5700G - ASUS ROG STRIX B550-I GAMING - AMD Renoir

ManjaroLinux 22.0.0 - 5.19.16-2-MANJARO - X Server 1.21.1.4

16 Systems - 46 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

15 Systems - 46 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

14 Systems - 46 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

13 Systems - 42 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

12 Systems - 35 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

9 Systems - 31 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

8 Systems - 31 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

7 Systems - 31 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

6 Systems - 31 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

5 Systems - 23 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

4 Systems - 23 Benchmark Results

ARMv8 Cortex-A78E - EDK II 1.0-d7fb19b - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

1 System - 5 Benchmark Results

ARMv8 rev 0 - NVIDIA Jetson Xavier NX Developer Kit - 8GB

Ubuntu 18.04 - 4.9.253-tegra - Unity 7.5.0

Most Popular Test Results

OpenBenchmarking.org Results Compare

18 Systems - 36 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

17 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

16 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

10 Systems - 59 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.19.5-041905-generic - GNOME Shell 3.28.3

1 System - 5 Benchmark Results

Intel Core i7-6850K - ASUS X99-E WS/USB 3.1 - Intel Xeon E7 v4

Ubuntu 20.04 - 5.4.0-53-generic - GNOME Shell 3.36.4

1 System - 5 Benchmark Results

Intel Xeon D-2146NT - Supermicro X11SDW-8C-TP13F v1.02 - Intel Sky Lake-E DMI3 Registers

Ubuntu 18.04 - 4.18.0-15-generic - GNOME Shell 3.28.3

8 Systems - 76 Benchmark Results

ARMv7 rev 3 - ODROID-XU4 Hardkernel Odroid XU4 - 2048MB

Ubuntu 18.04 - 4.14.37-135 - X Server 1.19.6

13 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

11 Systems - 33 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

2 Systems - 14 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

11 Systems - 59 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.19.5-041905-generic - GNOME Shell 3.28.3

9 Systems - 77 Benchmark Results

ARMv8 Cortex-A73 - Hardkernel ODROID-N2 - 4096MB

Ubuntu 18.04 - 4.9.156-14 - GCC 7.3.0

12 Systems - 59 Benchmark Results

Intel Core i9-9900K - EVOC P7xxTM1 powered by premamod - Intel 8th Gen Core 8-core Desktop

Ubuntu 18.04 - 5.2.10 - GNOME Shell 3.28.4

5 Systems - 71 Benchmark Results

ARMv8 rev 1 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

Find More Test Results