CUDA Mini-Nbody

The CUDA version of Harrism's mini-nbody tests.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark cuda-mini-nbody.

Project Site

github.com

Test Created

10 November 2015

Last Updated

15 March 2019

Test Maintainer

Michael Larabel 

Test Type

Graphics

Average Install Time

2 Seconds

Average Run Time

39 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

100k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsCUDA Mini-Nbody Popularity Statisticspts/cuda-mini-nbody2015.112016.022016.052016.082016.112017.022017.052017.082017.112018.022018.052018.082018.112019.022019.052019.082019.112020.022020.052020.082020.112021.022021.052021.082021.112022.022022.052022.082022.112023.022023.052023.082023.112024.0220K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 18 March 2024.
Cache Blocking18.7%Flush Denormals To Zero18.3%SOA Data Layout18.4%Original25.7%Loop Unrolling18.9%Test Option PopularityOpenBenchmarking.org

Revision History

pts/cuda-mini-nbody-1.1.1   [View Source]   Fri, 15 Mar 2019 16:35:43 GMT
Fix MIB to HIB proportion.

pts/cuda-mini-nbody-1.1.0   [View Source]   Fri, 07 Dec 2018 14:52:29 GMT
Update https://github.com/phoronix-test-suite/test-profiles/pull/22

pts/cuda-mini-nbody-1.0.1   [View Source]   Sat, 11 Jun 2016 12:14:06 GMT
Set /usr/local/cuda/bin in PATH when needed

pts/cuda-mini-nbody-1.0.0   [View Source]   Tue, 10 Nov 2015 13:41:29 GMT
CUDA mini Nbody


Performance Metrics

Analyze Test Configuration:

CUDA Mini-Nbody 2015-11-10

Test: Original

OpenBenchmarking.org metrics for this test profile configuration based on 231 public results since 7 December 2018 with the latest data as of 6 March 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
(NBody^2)/s (Average)
Mid-Tier
75th
< 307
Median
50th
97
OpenBenchmarking.orgDistribution Of Public Results - Test: Original231 Results Range From 3 To 115325 (NBody^2)/s3231046176924923111538138451615218459207662307325380276872999432301346083691539222415294383646143484505075753064553715767859985622926459966906692137152073827761347844180748830558536287669899769228394590968979920410151110381810612510843211073911304611535350100150200250

Based on OpenBenchmarking.org data, the selected test / test configuration (CUDA Mini-Nbody 2015-11-10 - Test: Original) has an average run-time of 5 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkTest: OriginalRun-Time612182430Min: 1 / Avg: 4.84 / Max: 27

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.4%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsTest: OriginalDeviation246810Min: 0 / Avg: 0.39 / Max: 3

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
ARMv8 64-bit
aarch64
ARMv8 Cortex-A57 4-Core, ARMv8 Cortex-A57 6-Core, ARMv8 Cortex-A78E 12-Core, ARMv8 Cortex-A78E 6-Core, ARMv8 Cortex-A78E 8-Core, ARMv8 rev 0 2-Core, ARMv8 rev 0 4-Core, ARMv8 rev 0 6-Core, ARMv8 rev 0 8-Core, ARMv8 rev 1 2-Core, ARMv8 rev 1 4-Core, ARMv8 rev 3 4-Core

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 3 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ProArt X670E-CREATOR WIFI - AMD Device 14d8

Pop 22.04 - 6.6.10-76060610-generic - GNOME Shell 42.5

1 System - 5 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ProArt X670E-CREATOR WIFI - AMD Device 14d8

Pop 22.04 - 6.6.10-76060610-generic - GNOME Shell 42.5

1 System - 2 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ProArt X670E-CREATOR WIFI - AMD Device 14d8

Pop 22.04 - 6.6.10-76060610-generic - GNOME Shell 42.5

1 System - 3 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ProArt X670E-CREATOR WIFI - AMD Device 14d8

Pop 22.04 - 6.6.10-76060610-generic - GNOME Shell 42.5

1 System - 5 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ProArt X670E-CREATOR WIFI - AMD Device 14d8

Pop 22.04 - 6.6.10-76060610-generic - GNOME Shell 42.5

8 Systems - 33 Benchmark Results

ARMv8 Cortex-A78E - EDK II r35.3.1-63e71c9-dirty - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

7 Systems - 33 Benchmark Results

ARMv8 Cortex-A78E - EDK II r35.3.1-63e71c9-dirty - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

7 Systems - 29 Benchmark Results

ARMv8 Cortex-A78E - EDK II r35.3.1-63e71c9-dirty - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

6 Systems - 29 Benchmark Results

ARMv8 Cortex-A78E - EDK II r35.3.1-63e71c9-dirty - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

5 Systems - 29 Benchmark Results

ARMv8 Cortex-A78E - EDK II r35.3.1-63e71c9-dirty - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

4 Systems - 25 Benchmark Results

ARMv8 Cortex-A78E - EDK II r35.3.1-63e71c9-dirty - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

3 Systems - 17 Benchmark Results

ARMv8 Cortex-A78E - EDK II r35.3.1-63e71c9-dirty - 30GB

Ubuntu 20.04 - 5.10.104-tegra - GNOME Shell 3.36.9

1 System - 5 Benchmark Results

Intel Core i7-13700E - (2.04 BIOS) - Intel Device 7aa7

Ubuntu 22.04 - 6.5.0-21-generic - GNOME Shell 42.9

1 System - 5 Benchmark Results

Intel Core i9-13900K - (2.04 BIOS) - Intel Device 7aa7

Ubuntu 22.04 - 6.5.0-18-generic - GNOME Shell 42.9

4 Systems - 21 Benchmark Results

ARMv8 Cortex-A78E - EDK II 4.1-33958178 - 8GB

Ubuntu 20.04 - 5.10.120-tegra - GNOME Shell 3.36.9

Most Popular Test Results

OpenBenchmarking.org Results Compare

8 Systems - 76 Benchmark Results

ARMv7 rev 1 - Rockchip - 2048MB

Debian 9.0 - 4.4.16-00006-g4431f98-dirty - LXDE

10 Systems - 59 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.19.5-041905-generic - GNOME Shell 3.28.3

18 Systems - 36 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

17 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

16 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

2 Systems - 14 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

1 System - 5 Benchmark Results

Intel Core i7-6850K - ASUS X99-E WS/USB 3.1 - Intel Xeon E7 v4

Ubuntu 20.04 - 5.4.0-53-generic - GNOME Shell 3.36.4

1 System - 5 Benchmark Results

Intel Xeon D-2146NT - Supermicro X11SDW-8C-TP13F v1.02 - Intel Sky Lake-E DMI3 Registers

Ubuntu 18.04 - 4.18.0-15-generic - GNOME Shell 3.28.3

10 Systems - 77 Benchmark Results

Amlogic ARMv8 Cortex-A53 - ODROID-C2 - 2048MB

Ubuntu 18.04 - 3.16.57-20 - X Server 1.19.6

13 Systems - 35 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

9 Systems - 77 Benchmark Results

ARMv7 rev 1 - Rockchip - 2048MB

Debian 9.0 - 4.4.16-00006-g4431f98-dirty - LXDE

12 Systems - 59 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.19.5-041905-generic - GNOME Shell 3.28.3

11 Systems - 33 Benchmark Results

ARMv8 rev 0 - Jetson-AGX - 16GB

Ubuntu 18.04 - 4.9.140-tegra - Unity 7.5.0

11 Systems - 59 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.19.5-041905-generic - GNOME Shell 3.28.3

Find More Test Results