[ML] Revisit correction for bucket length when computing probability quantiles normalising results #2276

tveasey · 2022-05-23T13:32:55Z

Currently, we try and account for the different number of results per time interval for different bucket lengths when computing probability quantiles as part of rate limiting normalised scores.

The assumption we make should result in roughly the same score distribution when modelling pure noise (I need to recheck this). However, it will tend to pull down scores when there are long lasting anomalies in the data for short bucket lengths. This has probably been exacerbated by multi-bucket anomaly detection. In particular, we see cases that the correction has the reverse impact to the one we'd like: producing less consistent scoring for different bucket lengths.

We should revisit whether any sort of bucket length correction rate limiting normalised scores is a good idea.

tveasey added the :ml label May 23, 2022

tveasey self-assigned this May 23, 2022

This was referenced May 24, 2022

[ML] Improve robustness to outliers after detecting changes in time series #2280

Merged

[ML] Improve normalisation of anomaly detection results for short bucket lengths #2285

Merged

tveasey closed this as completed in #2285 May 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Revisit correction for bucket length when computing probability quantiles normalising results #2276

[ML] Revisit correction for bucket length when computing probability quantiles normalising results #2276

tveasey commented May 23, 2022

[ML] Revisit correction for bucket length when computing probability quantiles normalising results #2276

[ML] Revisit correction for bucket length when computing probability quantiles normalising results #2276

Comments

tveasey commented May 23, 2022