CUDA Mini-Nbody

The CUDA version of Harrism's mini-nbody tests.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark cuda-mini-nbody.

Project Site

github.com

Test Created

10 November 2015

Last Updated

15 March 2019

Test Maintainer

Michael Larabel 

Test Type

Graphics

Average Install Time

1 Second

Average Run Time

19 Minutes, 32 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

90k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsCUDA Mini-Nbody Popularity Statisticspts/cuda-mini-nbody2015.112016.012016.032016.052016.072016.092016.112017.012017.032017.052017.072017.092017.112018.012018.032018.052018.072018.092018.112019.012019.032019.052019.072019.092019.112020.012020.032020.052020.072020.092020.112021.012021.032021.052021.072021.0920K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data current as of 27 October 2021.
Cache Blocking18.2%Flush Denormals To Zero18.1%SOA Data Layout18.1%Original26.5%Loop Unrolling19.0%Test Option PopularityOpenBenchmarking.org

Revision History

pts/cuda-mini-nbody-1.1.1   [View Source]   Fri, 15 Mar 2019 16:35:43 GMT
Fix MIB to HIB proportion.

pts/cuda-mini-nbody-1.1.0   [View Source]   Fri, 07 Dec 2018 14:52:29 GMT
Update https://github.com/phoronix-test-suite/test-profiles/pull/22

pts/cuda-mini-nbody-1.0.1   [View Source]   Sat, 11 Jun 2016 12:14:06 GMT
Set /usr/local/cuda/bin in PATH when needed

pts/cuda-mini-nbody-1.0.0   [View Source]   Tue, 10 Nov 2015 13:41:29 GMT
CUDA mini Nbody


Performance Metrics

Analyze Test Configuration:

CUDA Mini-Nbody 2015-11-10

Test: Original

OpenBenchmarking.org metrics for this test profile configuration based on 3,278 public results since 7 December 2018 with the latest data as of 9 September 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
(NBody^2)/s (Average)
98th
109
26113
Mid-Tier
75th
< 445
Median
50th
242
OpenBenchmarking.orgDistribution Of Public Results - Test: Original3278 Results Range From 3 To 93114 (NBody^2)/s3186637295592745593181118113044149071677018633204962235924222260852794829811316743353735400372633912640989428524471546578484415030452167540305589357756596196148263345652086707168934707977266074523763867824980112819758383885701875648942791290931535001000150020002500

Based on OpenBenchmarking.org data, the selected test / test configuration (CUDA Mini-Nbody 2015-11-10 - Test: Original) has an average run-time of 11 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkTest: OriginalRun-Time1224364860Min: 1 / Avg: 10.74 / Max: 60

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.5%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsTest: OriginalDeviation246810Min: 0 / Avg: 0.5 / Max: 4

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 1 Benchmark Result

Phytium S2500/64 C00 - Default - 16 x 16384 MB DDR4-3200MT

Kylin Linux Advanced Server V10 - 4.19.90-23.6.v2101.ky10.aarch64 - X Server 1.20.8

3 Systems - 6 Benchmark Results

AMD Ryzen Threadripper 2990WX 32-Core - ASUS PRIME X399-A - AMD 17h

slackware 14.2+ - 5.14.2 - KDE Plasma 5.22.5

18 Systems - 39 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

17 Systems - 39 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

15 Systems - 37 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

14 Systems - 37 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

13 Systems - 37 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

12 Systems - 37 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

11 Systems - 37 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

10 Systems - 37 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

9 Systems - 37 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

11 Systems - 31 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

10 Systems - 31 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

9 Systems - 31 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

8 Systems - 31 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

Most Popular Test Results

OpenBenchmarking.org Results Compare

18 Systems - 36 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

17 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

16 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

10 Systems - 59 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.19.5-041905-generic - GNOME Shell 3.28.3

13 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

11 Systems - 33 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

8 Systems - 76 Benchmark Results

ARMv8 rev 3 - quill - 8192MB

Ubuntu 16.04 - 4.4.38-tegra - Unity 7.4.0

1 System - 5 Benchmark Results

Intel Core i7-6850K - ASUS X99-E WS/USB 3.1 - Intel Xeon E7 v4

Ubuntu 20.04 - 5.4.0-53-generic - GNOME Shell 3.36.4

1 System - 5 Benchmark Results

Intel Xeon D-2146NT - Supermicro X11SDW-8C-TP13F v1.02 - Intel Sky Lake-E DMI3 Registers

Ubuntu 18.04 - 4.18.0-15-generic - GNOME Shell 3.28.3

11 Systems - 59 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.19.5-041905-generic - GNOME Shell 3.28.3

2 Systems - 14 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

15 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

9 Systems - 77 Benchmark Results

ARMv7 rev 4 - BCM2835 Raspberry Pi 3 Model B Plus Rev 1.3 - 926MB

Raspbian 9.6 - 4.19.23-v7+ - LXDE

5 Systems - 25 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

Find More Test Results