[ML] Address the root cause for "actual equals typical equals zero" anomalies #2270

tveasey · 2022-05-16T17:39:55Z

We've had multiple issues reported in the past where we report high anomaly scores when the actual is the same as the typical and both are equal to zero. This has been exacerbated in the past because we've also had some instabilities in the modelling when we stop receiving data for a partition, which was an edge case when this could occur.

In all problematic cases, the underlying reason is that if we know the data are non-negative we truncate the prediction when we report the typical value. However, up until now we haven't truncated when computing probabilities.

This change switches to always truncating our prediction. (This is a halfway house to a more rigorous approach which would condition that the predicted distribution is non-negative.) This should be good enough to ensure we never create non-zero anomaly scores for cases where actual equals typical equals zero except when the result is a multi-bucket anomaly.

droberts195

LGTM

The overall approach looks good. I don't know the code well enough to say whether the truncation has been applied everywhere where it needs to be. Probably the biggest chance of a mistake in this PR is a line that hasn't been changed rather than one that has, but at least that would be no worse than now.

lib/maths/time_series/CTimeSeriesModel.cc

docs/CHANGELOG.asciidoc

…lly calling the wrong implementation

…og error (#2281) #2270 showed up an edge case which caused us to start generating error messages in one of our QA tests. We simply need to exit early in this case.

…nomalies (#2270) We've had multiple issues reported in the past where we report high anomaly scores when the actual is the same as the typical and both are equal to zero. This has been exacerbated in the past because we've also had some instabilities in the modelling when we stop receiving data for a partition, which was an edge case when this could occur. In all problematic cases, the underlying reason is the if we know the data are non-negative we truncate the prediction when we report the typical value. However, up until now we haven't truncated when computing probabilities. This changes switches to always truncating our prediction. (This is a halfway house to a more rigorous approach which would condition the predicted distribution to be non-negative.) This should be good enough to ensure we never create non- zero anomaly scores for cases where actual equals typical equals zero except when the result is a multi-bucket anomaly.

tveasey added 4 commits May 15, 2022 15:20

Improve non-negative time series modelling

6649c95

Small tidy up

9205e5b

Testing

d26267a

More testing

aa3ec6d

tveasey added review :ml affects-results v8.3.0 labels May 16, 2022

tveasey added 2 commits May 16, 2022 18:41

Docs

2871958

Windows build

62fe2f1

droberts195 approved these changes May 17, 2022

View reviewed changes

lib/maths/time_series/CTimeSeriesModel.cc Show resolved Hide resolved

docs/CHANGELOG.asciidoc Show resolved Hide resolved

tveasey added 6 commits May 18, 2022 10:04

Require smoothing is explicit for private function to avoid accidenta…

5734348

…lly calling the wrong implementation

Type fix

ca7d731

Workaround compiler bug

91438a9

Formatting

772b6fa

Tidy up

d24fc98

Review comment

44ddc14

tveasey merged commit d51b461 into elastic:main May 19, 2022

tveasey deleted the non-negative branch May 19, 2022 12:51

tveasey mentioned this pull request May 24, 2022

[ML] Trap edge case leading to "No values added to quantile sketch" log error #2281

Merged

tveasey mentioned this pull request Jul 27, 2023

[7.17][ML] Address the root cause for "actual equals typical equals zero" anomalies #2551

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Address the root cause for "actual equals typical equals zero" anomalies #2270

[ML] Address the root cause for "actual equals typical equals zero" anomalies #2270

Uh oh!

tveasey commented May 16, 2022 •

edited

Loading

Uh oh!

droberts195 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[ML] Address the root cause for "actual equals typical equals zero" anomalies #2270

[ML] Address the root cause for "actual equals typical equals zero" anomalies #2270

Uh oh!

Conversation

tveasey commented May 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

droberts195 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tveasey commented May 16, 2022 •

edited

Loading