[ML] Reduce false positives associated with the multi-bucket feature #491

tveasey · 2019-06-07T11:11:01Z

This improves the interaction between multi-bucket feature and the model we maintain attending specifically to periods flagged as anomalous. In particular, we were suppressing updates of this anomaly model if the bucket probability was high but continuing to adjust the overall probability if the multi-bucket feature probability was low. The upshot our scores were inappropriately high.

This change pegs the time used to compute the anomaly duration to the last update time for the anomaly model and also smoothly reduces the impact on our overall result if the bucket value itself is normal.

This should help with #477.

edsavage

LGTM - just one suggestion made inline

edsavage · 2019-06-07T11:41:19Z

lib/maths/CTimeSeriesModel.cc

+    bool anomalyProbability(double& result) const;
+
+    //! Get the largest probability the model counts as anomalous.
+    double largestAnomalyProbability() const {


A bit of a nit pick - feel free to disregard! - Rename this method to say twiceLargestSignificantProbability?

I think I'll leave this. It is the probability which is deemed sufficiently anomalous to update this model and although it is currently twice the cutoff probability to output a result there is nothing that requires this, it was just a reasonable value.

droberts195 · 2019-06-07T12:24:29Z

lib/maths/CTimeSeriesModel.cc

-    inserter.insertValue(VERSION_6_5_TAG, "");
+    inserter.insertValue(VERSION_7_2_TAG, "");
    if (m_Anomaly) {
        inserter.insertLevel(ANOMALY_6_5_TAG, boost::bind(&CAnomaly::acceptPersistInserter,


This is no longer restored, so is it worth persisting it? Or is it a mistake not to restore it?

I'd deleted the restore code from the wrong branch. Good catch.

droberts195 · 2019-06-07T12:31:34Z

Also, I'm not convinced it's a good idea to put this into 7.2.1. This change affects results and 7.2.1 is going to be a patch release that people will upgrade to expecting no change to functionality. Potentially Cloud clusters could be force-upgraded to 7.2.1 if there turns out to be a security bug in 7.2.0. Technically this is a bug fix, but if there's some set of circumstances where the anomaly detection is clearly worse after this change then we'll still have a very annoyed user.

Therefore I would say that either we prioritise testing this next Monday and Tuesday and get it into 7.2.0 or else it waits for 7.3.0.

And if it waits for 7.3.0 then the 7.2 constant in the code should be changed.

tveasey · 2019-06-07T13:42:29Z

Ok I agree this is on the fence regarding bug fix vs enhancement. I'm happy to delay until 7.3 and we can test more. I've updated the naming accordingly.

droberts195

LGTM

droberts195 · 2019-06-07T13:52:16Z

docs/CHANGELOG.asciidoc


 * Fix an edge case causing spurious anomalies (false positives) if the variance in the count of events
 changed significantly throughout the period of a seasonal quantity. (See {ml-pull}489[#489].)
+* Reduce false positives associated with the multi-bucket feature. (See {ml-pull}491[#491].)


Please move this to the 7.3.0 section.

Crazybus · 2019-06-07T15:00:17Z

jenkins test this please

…lastic#491)

…494) Backport #491.

Improve interaction between multi-buckets and anomaly model

3b2da0f

tveasey added review :ml affects-results v8.0.0 v7.2.1 labels Jun 7, 2019

tveasey requested a review from edsavage June 7, 2019 11:11

Docs

ad98bc3

edsavage approved these changes Jun 7, 2019

View reviewed changes

droberts195 reviewed Jun 7, 2019

View reviewed changes

droberts195 added the v7.3.0 label Jun 7, 2019

tveasey added 2 commits June 7, 2019 14:35

Fix state upgrade

eda0a21

Rename variables to reflect target version change

c9eb934

droberts195 removed the v7.2.1 label Jun 7, 2019

droberts195 approved these changes Jun 7, 2019

View reviewed changes

Initialisation error

7b45896

Update docs

19075f9

tveasey merged commit 967ab7a into elastic:master Jun 7, 2019

droberts195 mentioned this pull request Jun 10, 2019

AutodetectMemoryLimitIT#testTooManyPartitions fails with too large a model elastic/elasticsearch#43013

Closed

tveasey mentioned this pull request Jun 10, 2019

[ML] Reenable integration test and relax test tolerance for Linux elastic/elasticsearch#43031

Merged

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request Jun 10, 2019

[ML] Reduce false positives associated with the multi-bucket feature (e…

dab327b

…lastic#491)

tveasey mentioned this pull request Jun 10, 2019

[7.3][ML] Reduce false positives associated with the multi-bucket feature #494

Merged

tveasey added a commit that referenced this pull request Jun 11, 2019

[ML] Reduce false positives associated with the multi-bucket feature (#…

204b1dd

…494) Backport #491.

tveasey mentioned this pull request Feb 26, 2020

[ML] False positives associated with multi-bucket features #477

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Reduce false positives associated with the multi-bucket feature #491

[ML] Reduce false positives associated with the multi-bucket feature #491

Uh oh!

tveasey commented Jun 7, 2019

Uh oh!

edsavage left a comment

Uh oh!

edsavage Jun 7, 2019

Uh oh!

tveasey Jun 7, 2019

Uh oh!

droberts195 Jun 7, 2019

Uh oh!

tveasey Jun 7, 2019

Uh oh!

droberts195 commented Jun 7, 2019

Uh oh!

tveasey commented Jun 7, 2019

Uh oh!

droberts195 left a comment

Uh oh!

droberts195 Jun 7, 2019

Uh oh!

Crazybus commented Jun 7, 2019

Uh oh!

Uh oh!

[ML] Reduce false positives associated with the multi-bucket feature #491

[ML] Reduce false positives associated with the multi-bucket feature #491

Uh oh!

Conversation

tveasey commented Jun 7, 2019

Uh oh!

edsavage left a comment

Choose a reason for hiding this comment

Uh oh!

edsavage Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

tveasey Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

droberts195 Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

tveasey Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

droberts195 commented Jun 7, 2019

Uh oh!

tveasey commented Jun 7, 2019

Uh oh!

droberts195 left a comment

Choose a reason for hiding this comment

Uh oh!

droberts195 Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

Crazybus commented Jun 7, 2019

Uh oh!

Uh oh!