Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial time-series data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark numenta-nab.

Project Site

github.com

Source Repository

github.com

Test Created

9 November 2018

Last Updated

14 December 2022

Test Maintainer

Michael Larabel 

Test Type

System

Average Install Time

57 Seconds

Average Run Time

18 Minutes, 10 Seconds

Test Dependencies

Python

Accolades

50k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsNumenta Anomaly Benchmark Popularity Statisticspts/numenta-nab2018.112019.012019.032019.052019.072019.092019.112020.012020.032020.052020.072020.092020.112021.012021.032021.052021.072021.092021.112022.012022.032022.052022.072022.092022.112023.012023.032023.052023.072023.092023.112024.012024.032024.052024.072024.094K8K12K16K20K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 12 October 2024.
KNN CAD6.7%Contextual Anomaly Detector OSE6.7%Windowed Gaussian18.1%EXPoSE7.0%Relative Entropy20.4%Earthgecko Skyline20.5%Bayesian Changepoint20.6%Detector Option PopularityOpenBenchmarking.org

Revision History

pts/numenta-nab-1.1.1   [View Source]   Wed, 14 Dec 2022 14:04:05 GMT
Fixes for running on modern Linux distributions.

pts/numenta-nab-1.1.0   [View Source]   Fri, 28 Feb 2020 12:04:17 GMT
Update against NAB 1.1 upstream for Python 3 compatibility.

pts/numenta-nab-1.0.0   [View Source]   Fri, 09 Nov 2018 11:55:56 GMT
Initial commit of Numenta Anomaly Benchmark

Suites Using This Test

Machine Learning

HPC - High Performance Computing

CPU Massive

Server CPU Tests

Python


Performance Metrics

Analyze Test Configuration:

Numenta Anomaly Benchmark 1.1

Detector: Bayesian Changepoint

OpenBenchmarking.org metrics for this test profile configuration based on 730 public results since 28 February 2020 with the latest data as of 3 October 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Seconds (Average)
Mid-Tier
75th
> 26
Median
50th
34
49th
16
34 +/- 2
40th
3
37 +/- 1
38th
3
38
31st
4
44 +/- 1
29th
10
45 +/- 2
27th
3
49 +/- 1
Low-Tier
25th
> 51
23rd
3
57 +/- 7
21st
3
61 +/- 6
21st
4
62 +/- 1
20th
7
64 +/- 1
17th
6
71 +/- 1
12th
4
85 +/- 2
10th
4
92 +/- 1
4th
4
209 +/- 1
OpenBenchmarking.orgDistribution Of Public Results - Detector: Bayesian Changepoint730 Results Range From 10 To 1732 Seconds105192133174215256297338379420461502543584625666707748789830871912953994103510761117115811991240128113221363140414451486152715681609165016911732120240360480600

Based on OpenBenchmarking.org data, the selected test / test configuration (Numenta Anomaly Benchmark 1.1 - Detector: Bayesian Changepoint) has an average run-time of 4 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkDetector: Bayesian ChangepointRun-Time714212835Min: 1 / Avg: 3.41 / Max: 32

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.7%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsDetector: Bayesian ChangepointDeviation246810Min: 0 / Avg: 0.74 / Max: 4

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseNumenta Anomaly Benchmark CPU Core ScalingDetector: Bayesian Changepoint24681216182432641281.00832.01663.02494.03325.0415

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
IBM POWER (PowerPC) 64-bit
ppc64le
POWER9 44-Core
ARMv7 32-bit
armv7l
ARMv7 Cortex-A53 4-Core, ARMv7 Cortex-A72 4-Core
ARMv8 64-bit
arm64
Apple M1
ARMv8 64-bit
aarch64
ARMv8 Cortex-A76 4-Core, ARMv8 Cortex-A78E 6-Core, ARMv8 Neoverse-N1, ARMv8 rev 0 8-Core, Ampere Altra ARMv8 Neoverse-N1 160-Core, Rockchip ARMv8 Cortex-A76 6-Core

Recent Test Results

OpenBenchmarking.org Results Compare

2 Systems - 195 Benchmark Results

2 Systems - 195 Benchmark Results

2 Systems - 195 Benchmark Results

2 Systems - 195 Benchmark Results

2 Systems - 195 Benchmark Results

2 Systems - 195 Benchmark Results

2 Systems - 195 Benchmark Results

2 Systems - 195 Benchmark Results

1 System - 5 Benchmark Results

AMD Ryzen 5 9600X 6-Core - Gigabyte B650 AORUS ELITE AX - AMD Device 14d8

NobaraLinux 40 - 6.11.1-202.fsync.fc40.x86_64 - KDE Plasma 6.1.4

1 System - 334 Benchmark Results

Intel Core i3-12100 - ASRock H610M-HDV/M.2 R2.0 - Intel Device 7aa7

Ubuntu 20.04 - 5.15.0-89-generic - GNOME Shell 3.36.9

1 System - 275 Benchmark Results

Intel Core i5-12500 - ASUS PRIME Z690M-HZ - Intel Alder Lake-S PCH

Debian 12 - 6.1.0-21-amd64 - GCC 12.2.0

Find More Test Results