Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark rodinia.

Project Site

cs.virginia.edu

Test Created

23 January 2013

Last Updated

28 December 2021

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Minute, 18 Seconds

Average Run Time

20 Minutes, 55 Seconds

Test Dependencies

C/C++ Compiler Toolchain + OpenMPI + OpenCL

Accolades

150k+ Downloads + 5k+ Public Benchmark Results

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsRodinia Popularity Statisticspts/rodinia2013.012013.052013.092014.012014.052014.092015.012015.052015.092016.012016.052016.092017.012017.052017.092018.012018.052018.092019.012019.052019.092020.012020.052020.092021.012021.052021.092022.012022.052022.092023.012023.052023.092024.0120K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 25 March 2024.
OpenMP Leukocyte16.4%OpenMP CFD Solver25.1%OpenMP Streamcluster16.8%OpenMP LavaMD29.3%OpenMP HotSpot3D12.5%Test Option PopularityOpenBenchmarking.org

Revision History

pts/rodinia-1.3.2   [View Source]   Tue, 28 Dec 2021 09:31:58 GMT
Update Rodinia 3.1 download links as upstream host seems down..... Closes: [200~https://github.com/phoronix-test-suite/phoronix-test-suite/issues/583[201~

pts/rodinia-1.3.1   [View Source]   Thu, 09 Jul 2020 18:37:47 GMT
Drop OpenMP Myocyte.

pts/rodinia-1.3.0   [View Source]   Tue, 23 Jun 2020 12:01:54 GMT
Update against upstream Rodinia 3.1, increase the intensity of some of the problems.

pts/rodinia-1.2.2   [View Source]   Thu, 08 Oct 2015 15:59:24 GMT
Rodinia works on BSD.

pts/rodinia-1.2.1   [View Source]   Thu, 07 Nov 2013 09:07:01 GMT
Add opencl as external dependency.

pts/rodinia-1.2.0   [View Source]   Thu, 07 Nov 2013 07:59:47 GMT
Few more Rodinia OpenCL improvements.

pts/rodinia-1.1.1   [View Source]   Thu, 07 Nov 2013 07:35:14 GMT
Update Rodinia OpenCL Heartwall and Myocyte tests.

pts/rodinia-1.1.0   [View Source]   Thu, 07 Nov 2013 01:03:48 GMT
Update against upstream Rodinia 2.4 release. Also incorporate OpenCL tests to complement OpenMP tests.

pts/rodinia-1.0.0   [View Source]   Wed, 23 Jan 2013 16:16:40 GMT
Publish Rodinia 2.2 accelerator benchmarks.

Suites Using This Test

OpenCL

Multi-Core

Common Workstation Benchmarks

HPC - High Performance Computing

CPU Massive

Server CPU Tests

NVIDIA GPU Compute


Performance Metrics

Analyze Test Configuration:

Rodinia 2.4

Test: OpenMP LavaMD

OpenBenchmarking.org metrics for this test profile configuration based on 3,708 public results since 18 January 2014 with the latest data as of 28 March 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Seconds (Average)
90th
12
15 +/- 1
Mid-Tier
75th
> 32
74th
25
34 +/- 1
72nd
4
37 +/- 1
60th
11
61 +/- 4
53rd
5
85 +/- 1
52nd
8
90 +/- 2
Median
50th
94
50th
3
95 +/- 1
49th
3
99 +/- 7
48th
37
100 +/- 1
43rd
35
114 +/- 3
42nd
7
115 +/- 11
37th
3
155 +/- 23
37th
3
158 +/- 2
34th
56
177 +/- 1
32nd
5
190 +/- 2
32nd
3
194 +/- 13
31st
12
196 +/- 28
31st
6
198 +/- 7
31st
3
200 +/- 11
27th
18
224 +/- 1
Low-Tier
25th
> 226
25th
4
231 +/- 14
23rd
3
253 +/- 20
23rd
3
254 +/- 26
21st
3
279 +/- 2
20th
3
284 +/- 13
19th
3
328 +/- 12
18th
5
336 +/- 17
18th
3
352 +/- 1
18th
5
353 +/- 1
15th
5
407 +/- 25
15th
5
411 +/- 5
14th
10
434 +/- 13
13th
3
469 +/- 66
12th
4
499 +/- 8
12th
22
501 +/- 1
11th
11
528 +/- 15
11th
3
533 +/- 12
9th
13
689 +/- 1
1st
4
2095 +/- 129
1st
5
2145 +/- 25
OpenBenchmarking.orgDistribution Of Public Results - Test: OpenMP LavaMD3708 Results Range From 3 To 26694 Seconds35371071160521392673320737414275480953435877641169457479801385479081961510149106831121711751122851281913353138871442114955154891602316557170911762518159186931922719761202952082921363218972243122965234992403324567251012563526169267037001400210028003500

Based on OpenBenchmarking.org data, the selected test / test configuration (Rodinia 2.4 - Test: OpenMP LavaMD) has an average run-time of 8 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkTest: OpenMP LavaMDRun-Time20406080100Min: 1 / Avg: 7.37 / Max: 111

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.4%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsTest: OpenMP LavaMDDeviation246810Min: 0 / Avg: 0.44 / Max: 6

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseRodinia CPU Core ScalingTest: OpenMP LavaMD248163248121620

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
SSE2 (SSE2)
Used by default on supported hardware.
 
MOVAPD PUNPCKLQDQ XORPD UNPCKLPD MOVD MOVDQA SUBSD DIVSD MULSD ADDSD UCOMISD SQRTSD CVTSI2SD MOVUPD UNPCKHPD MULPD ADDPD SUBPD
Last automated analysis: 18 January 2022

This test profile binary relies on the shared libraries libm.so.6, libgomp.so.1, libc.so.6.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
SPARC64
sparc64
UltraSparc T3
IBM Z
s390x
(Many Processors)
RISC-V 64-bit
riscv64
sifive rocket0
IBM POWER (PowerPC) 64-bit
ppc64le
POWER8, POWER8E, POWER9 22-Core, POWER9 4-Core, POWER9 44-Core, POWER9 80-Core, POWER9 altivec supported 160-Core, POWER9 altivec supported 176-Core, POWER9 altivec supported 44-Core, POWER9 altivec supported 64-Core
IBM POWER (PowerPC) 64-bit
ppc64
POWER7, POWER9 4-Core, PPC970 altivec supported, PPC970MP
MIPS 64-bit
mips64
Loongson-3A R4
Intel / AMD x86 32-bit
i686
(Many Processors)
ARMv7 32-bit
armv7l
ARMv7 4-Core, ARMv7 Cortex-A72 4-Core, ARMv7 rev 1 4-Core, ARMv7 rev 2 4-Core, ARMv7 rev 3 2-Core, ARMv7 rev 3 4-Core, ARMv7 rev 3 8-Core
ARMv8 64-bit
aarch64
AArch64 rev 0, ARMv8 16-Core, ARMv8 2-Core, ARMv8 32-Core, ARMv8 64-Core, ARMv8 8-Core, ARMv8 Cortex-A53 4-Core, ARMv8 Cortex-A72, ARMv8 Cortex-A72 16-Core, ARMv8 Cortex-A72 2-Core, ARMv8 Cortex-A72 4-Core, ARMv8 Cortex-A72 6-Core, ARMv8 Cortex-A72 8-Core, ARMv8 Cortex-A73, ARMv8 Cortex-A73 6-Core, ARMv8 Neoverse-N1, ARMv8 Neoverse-N1 16-Core, ARMv8 Neoverse-N1 4-Core, ARMv8 Neoverse-N1 64-Core, ARMv8 Neoverse-N1 8-Core, ARMv8 rev 0 8-Core, ARMv8 rev 1 4-Core, ARMv8 rev 3 6-Core, Amlogic ARMv8 Cortex-A53 4-Core, Ampere Altra ARMv8 Neoverse-N1 160-Core, Ampere eMAG ARMv8 16-Core, Ampere eMAG ARMv8 32-Core, Ampere eMAG ARMv8 4-Core, Ampere eMAG ARMv8 8-Core, Cavium ThunderX, HUAWEI Kunpeng 920 2249K, Rockchip ARMv8 Cortex-A72 6-Core

Recent Test Results

OpenBenchmarking.org Results Compare

83 Systems - 23 Benchmark Results

AMD Athlon II X3 425 - ASRock 890FX Deluxe3 - AMD RD890 PCI-e GFX Hydra part

Ubuntu 18.04 - 4.17.0-999-generic - GCC 8.1.0

83 Systems - 23 Benchmark Results

Intel Celeron N3350 - APL ASAHI_AP - Intel Celeron N3350

Debian 11 - 5.10.0-9-amd64 - MATE 1.24.1

44 Systems - 69 Benchmark Results

AMD Ryzen 5 2600 Six-Core - Gigabyte B450 AORUS M - AMD 17h

Fedora 31 - 5.5.17-200.fc31.x86_64 - KDE Plasma 5.17.5

19 Systems - 35 Benchmark Results

4 x AMD EPYC (with IBPB) - QEMU Standard PC - Intel 440FX- 82441FX PMC

Ubuntu 18.04 - 5.3.0-51-generic - ext4

18 Systems - 35 Benchmark Results

4 x AMD EPYC - QEMU Standard PC - Intel 440FX- 82441FX PMC

Ubuntu 18.04 - 4.15.0-88-generic - ext4

17 Systems - 35 Benchmark Results

4 x QEMU Virtual 2.5+ - QEMU Standard PC - Intel 440FX- 82441FX PMC

Ubuntu 18.04 - 4.15.0-88-generic - ext4

15 Systems - 35 Benchmark Results

4 x AMD EPYC (with IBPB) - QEMU Standard PC - Intel 440FX- 82441FX PMC

Ubuntu 18.04 - 4.15.0-88-generic - ext4

16 Systems - 35 Benchmark Results

4 x QEMU Virtual 2.5+ - QEMU Standard PC - Intel 440FX- 82441FX PMC

Ubuntu 18.04 - 5.3.0-51-generic - ext4

15 Systems - 35 Benchmark Results

4 x AMD EPYC (with IBPB) - QEMU Standard PC - Intel 440FX- 82441FX PMC

Ubuntu 18.04 - 4.15.0-88-generic - ext4

Find More Test Results