Open FMM Nero2D

This is a test of Nero2D, which is a two-dimensional TM/TE solver for Open FMM. Open FMM is a free collection of electromagnetic software for scattering at very large objects. This test profile times how long it takes to solve one of the included 2D examples.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark nero2d.

Test Created

6 December 2010

Last Updated

7 February 2018

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

16 Seconds

Average Run Time

17 Minutes, 46 Seconds

Test Dependencies

C/C++ Compiler Toolchain + Fortran + FFTW + BLAS (Basic Linear Algebra Sub-Routine) + LAPACK (Linear Algebra Pack) + OpenMPI

Accolades

150k+ Downloads

Supported Platforms


Public Result Uploads *Reported Test Completions **Reported Installs **Test Profile Page Views ***OpenBenchmarking.orgEventsOpen FMM Nero2D Popularity Statisticspts/nero2d2010.122011.052011.102012.062012.112013.042013.092014.022014.072014.122015.052015.102016.032016.082017.012017.062017.112018.042018.092019.022019.072019.122020.052020.102021.032021.082022.012022.062022.112023.042023.092024.022024.072K4K6K8K10K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 23 November 2024.

Revision History

pts/nero2d-1.1.0   [View Source]   Wed, 07 Feb 2018 09:56:31 GMT
Run against modified sources that build with modern compilers, switch to MPI version.

pts/nero2d-1.0.1   [View Source]   Mon, 12 Mar 2012 15:45:10 GMT
Upstream OpenFMM / Nero2D is dead and all files removed, so host Nero2D source package on PTS mirror. Updated downloads.xml to r

pts/nero2d-1.0.0   [View Source]   Mon, 06 Dec 2010 14:58:14 GMT
Initial import into OpenBenchmarking.org

Suites Using This Test

Multi-Core

CPU Massive

MPI Benchmarks


Performance Metrics

Analyze Test Configuration:

Open FMM Nero2D 2.0.2

Total Time

OpenBenchmarking.org metrics for this test profile configuration based on 280 public results since 7 February 2018 with the latest data as of 13 July 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Seconds (Average)
Mid-Tier
75th
> 53
63rd
5
74 +/- 3
Median
50th
93
50th
4
93 +/- 5
49th
3
95 +/- 4
36th
6
144 +/- 17
Low-Tier
25th
> 226
16th
5
417 +/- 8
12th
3
480 +/- 2
OpenBenchmarking.orgDistribution Of Public Results - Total Time280 Results Range From 15 To 42008 Seconds15855169525353375421550555895673575758415925510095109351177512615134551429515135159751681517655184951933520175210152185522695235352437525215260552689527735285752941530255310953193532775336153445535295361353697537815386553949540335411754201560120180240300

Based on OpenBenchmarking.org data, the selected test / test configuration (Open FMM Nero2D 2.0.2 - Total Time) has an average run-time of 10 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkTotal TimeRun-Time20406080100Min: 1 / Avg: 10.04 / Max: 128

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.7%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsTotal TimeDeviation3691215Min: 0 / Avg: 0.7 / Max: 7

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
SSE2 (SSE2)
Used by default on supported hardware.
 
XORPD SUBSD ADDSD MULSD MOVAPD UCOMISD MOVDQU CVTSI2SD DIVSD COMISD ANDPD MOVDQA CVTTSD2SI MAXSD MINSD ANDNPD CMPNLESD ORPD SQRTSD CMPNLTSD CVTSD2SS CVTSS2SD CMPLESD
Requires passing a supported compiler/build flag (verified with targets: sandybridge, skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VINSERTF128 VPERM2F128 VEXTRACTF128 VPERMILPD VBROADCASTSS VBROADCASTSD
Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

 
VPERMPD VPERMQ VPBROADCASTD VEXTRACTI128 VPSLLVD VPBROADCASTQ VINSERTI128 VPBROADCASTW VPERM2I128 VPERMD
FMA (FMA)
Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Bulldozer (2011).

 
VFMADD132PD VFMSUB132PD VFMSUB132SD VFMADD132SD VFMADD231SD VFMSUB231SD VFNMADD231SD VFNMADD132PD VFMADD213SD VFNMSUB132SD VFNMADD132SD VFMSUB231PD VFNMSUB231SD VFNMADD231PD VFMADD213PD VFNMADD213SD VFMADD231PD VFMSUB213SD VFNMSUB213SD VFNMADD132SS VFMADD132SS
Advanced Vector Extensions 512 (AVX512)
Requires passing a supported compiler/build flag (verified with targets: cascadelake, sapphirerapids).
 
(ZMM REGISTER USE)
The test / benchmark does honor compiler flag changes.
Last automated analysis: 18 January 2022

This test profile binary relies on the shared libraries libfftw3.so.3, liblapack.so.3, libf77blas.so.3, libgfortran.so.5, libmpi.so.40, libm.so.6, libc.so.6, libopenblas.so.0, libatlas.so.3, libquadmath.so.0, libopen-pal.so.40, libopen-rte.so.40, libhwloc.so.15, libz.so.1, libudev.so.1.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
MIPS 64-bit
mips64
Loongson-3A3000 V0.9 FPU V0.1
Intel / AMD x86 32-bit
i686
(Many Processors)
ARMv7 32-bit
armv7l
ARMv7 Cortex-A7 4-Core, ARMv7 rev 1 4-Core