Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENH] Add STRAY anomaly detection #3337

Closed
KatieBuc opened this issue Aug 25, 2022 · 1 comment
Closed

[ENH] Add STRAY anomaly detection #3337

KatieBuc opened this issue Aug 25, 2022 · 1 comment
Assignees
Labels
enhancement Adding new functionality module:annotation

Comments

@KatieBuc
Copy link
Contributor

Add STRAY (Search TRace AnomalY) anomaly detection [1].

"The HDoutliers [2] algorithm is a powerful unsupervised algorithm for detecting anomalies in high-dimensional data, with a strong theoretical foundation. However, it suffers from some limitations that significantly hinder its performance level, under certain circumstances. In this article, we propose an algorithm that addresses these limitations. We define an anomaly as an observation where its k-nearest neighbor distance with the maximum gap is significantly different from what we would expect if the distribution of k-nearest neighbors with the maximum gap is in the maximum domain of attraction of the Gumbel distribution. An approach based on extreme value theory is used for the anomalous threshold calculation. Using various synthetic and real datasets, we demonstrate the wide applicability and usefulness of our algorithm, which we call the stray algorithm. "

This task involves re-writing this R code, as an estimator in the annotation module.

References:

  1. Talagala, Priyanga Dilini, Rob J. Hyndman, and Kate Smith-Miles. "Anomaly detection in high-dimensional data." Journal of Computational and Graphical Statistics 30.2 (2021): 360-374.
  2. Wilkinson, Leland. "Visualizing big data outliers through distributed aggregation." IEEE transactions on visualization and computer graphics 24.1 (2017): 256-266.
@lmmentel
Copy link
Contributor

closed by #3338

@lmmentel lmmentel moved this from Under review to Done in Workstream: annotation, segmentation Sep 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Adding new functionality module:annotation
Development

No branches or pull requests

2 participants