[ENH] Add STRAY anomaly detection #3337

KatieBuc · 2022-08-25T07:34:38Z

Add STRAY (Search TRace AnomalY) anomaly detection [1].

"The HDoutliers [2] algorithm is a powerful unsupervised algorithm for detecting anomalies in high-dimensional data, with a strong theoretical foundation. However, it suffers from some limitations that significantly hinder its performance level, under certain circumstances. In this article, we propose an algorithm that addresses these limitations. We define an anomaly as an observation where its k-nearest neighbor distance with the maximum gap is significantly different from what we would expect if the distribution of k-nearest neighbors with the maximum gap is in the maximum domain of attraction of the Gumbel distribution. An approach based on extreme value theory is used for the anomalous threshold calculation. Using various synthetic and real datasets, we demonstrate the wide applicability and usefulness of our algorithm, which we call the stray algorithm. "

This task involves re-writing this R code, as an estimator in the annotation module.

References:

Talagala, Priyanga Dilini, Rob J. Hyndman, and Kate Smith-Miles. "Anomaly detection in high-dimensional data." Journal of Computational and Graphical Statistics 30.2 (2021): 360-374.
Wilkinson, Leland. "Visualizing big data outliers through distributed aggregation." IEEE transactions on visualization and computer graphics 24.1 (2017): 256-266.

lmmentel · 2022-09-26T10:33:01Z

closed by #3338

KatieBuc added the enhancement Adding new functionality label Aug 25, 2022

KatieBuc mentioned this issue Aug 25, 2022

[ENH] STRAY anomaly detection #3338

Merged

lmmentel added this to Under review in Workstream: annotation, segmentation Sep 5, 2022

lmmentel added the module:annotation label Sep 13, 2022

lmmentel assigned KatieBuc Sep 13, 2022

lmmentel closed this as completed Sep 26, 2022

lmmentel moved this from Under review to Done in Workstream: annotation, segmentation Sep 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Add STRAY anomaly detection #3337

[ENH] Add STRAY anomaly detection #3337

KatieBuc commented Aug 25, 2022

lmmentel commented Sep 26, 2022

[ENH] Add STRAY anomaly detection #3337

[ENH] Add STRAY anomaly detection #3337

Comments

KatieBuc commented Aug 25, 2022

lmmentel commented Sep 26, 2022