[ML] Improve initialisation of the residual model after detecting new decomposition components #218

tveasey · 2018-09-27T11:15:43Z

Currently we use a small random sample of historical values to initialise the prediction residual model after we've detected new components of the time series decomposition. This sample is not large enough to be reliably representative of true variation we should expect and can occasionally lead to spurious anomalies immediately after a component is detected. As a result of the increased sensitivity related to #181 this has become more important.

We actually have a better sample available in the window of values we use to perform decomposition (albeit aggregated at a different time scale than the bucketing interval). This change switches to use these values instead to reinitialise the residual model.

dimitris-athanasiou

Looks good! Left a few minor comments.

dimitris-athanasiou · 2018-09-27T13:46:43Z

include/maths/CTimeSeriesDecomposition.h

    scale(core_t::TTime time, double variance, double confidence, bool smooth = true) const;

+    //! Get the values in a recent time window if they are available.
+    virtual TTimeDoublePrVec windowValues(core_t::TTime time, bool forced = false) const;


Could you document what forcing means here?

On reflection I think a slight refactor makes this easier to understand. I factored out the logic to test to see if the components might be added so this now always returns the decomposition window values. See this commit.

dimitris-athanasiou · 2018-09-27T13:49:15Z

lib/maths/CTimeSeriesModel.cc

 const std::string IS_NON_NEGATIVE_6_3_TAG{"b"};
 const std::string IS_FORECASTABLE_6_3_TAG{"c"};
-const std::string RNG_6_3_TAG{"d"};
+//const std::string RNG_6_3_TAG{"d"};


Do we usually just comment out removed tags? Might be better to include a message that includes the version of removal.

dimitris-athanasiou · 2018-09-27T13:50:08Z

lib/maths/CTimeSeriesModel.cc

    seed = CChecksum::calculate(seed, m_CurrentChangeInterval);
    seed = CChecksum::calculate(seed, m_ChangeDetector);
-    seed = CChecksum::calculate(seed, m_RecentSamples);
+    seed = CChecksum::calculate(seed, m_AnomalyModel);


This looks like it's being calculated twice now in this function

tveasey · 2018-09-27T17:52:27Z

@dimitris-athanasiou, I addressed your comments in my last commit. Can you take another look?

dimitris-athanasiou

LGTM The refactoring made things clearer indeed. Left another comment for explaining a hard number but it's good to go.

dimitris-athanasiou · 2018-09-28T08:26:24Z

lib/maths/CTimeSeriesDecompositionDetail.cc

+bool CTimeSeriesDecompositionDetail::CPeriodicityTest::shouldTest(ETest test,
+                                                                  core_t::TTime time) const {
+    // We need to test more frequently than we compress because it
+    // only happens each 336 buckets and would significantly delay


Where is this 336 coming from? It would be nice to explain that too in the comment.

Added an explanation in this commit.

… decomposition components (elastic#218)

…g new decomposition components (#223) Backport #218.

Initialise new components from the values in the decomposition window

70b134d

tveasey added >enhancement v7.0.0 review :ml affects-results v6.5.0 labels Sep 27, 2018

tveasey requested a review from dimitris-athanasiou September 27, 2018 11:15

Documentation

97e95b9

dimitris-athanasiou reviewed Sep 27, 2018

View reviewed changes

Refactor windowValues to improve code clarity. Some extra comments.

01ff263

dimitris-athanasiou approved these changes Sep 28, 2018

View reviewed changes

tveasey force-pushed the enhancement/new-component-initialisation branch 2 times, most recently from 25ee047 to 5cc6d6c Compare September 28, 2018 08:50

Add explaining comment

7bc3e87

tveasey force-pushed the enhancement/new-component-initialisation branch from 5cc6d6c to 7bc3e87 Compare September 28, 2018 09:32

tveasey merged commit 3f7eb0c into elastic:master Sep 28, 2018

tveasey changed the title ~~[ML] Improve initialisation of the residual model after detecting new decomposition component~~ [ML] Improve initialisation of the residual model after detecting new decomposition components Sep 28, 2018

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request Sep 28, 2018

[ML] Improve initialisation of the residual model after detecting new…

6f5aa09

… decomposition components (elastic#218)

tveasey mentioned this pull request Oct 1, 2018

[6.5][ML] Improve initialisation of the residual model after detecting new decomposition components #223

Merged

tveasey added a commit that referenced this pull request Oct 1, 2018

[6.5][ML] Improve initialisation of the residual model after detectin…

6ebee46

…g new decomposition components (#223) Backport #218.

tveasey mentioned this pull request Oct 17, 2018

[ML] Account for step discontinuities when reinitialising the residual model after a detecting a change or new decomposition components #260

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Improve initialisation of the residual model after detecting new decomposition components #218

[ML] Improve initialisation of the residual model after detecting new decomposition components #218

Uh oh!

tveasey commented Sep 27, 2018

Uh oh!

dimitris-athanasiou left a comment

Uh oh!

dimitris-athanasiou Sep 27, 2018

Uh oh!

tveasey Sep 27, 2018

Uh oh!

dimitris-athanasiou Sep 27, 2018

Uh oh!

dimitris-athanasiou Sep 27, 2018

Uh oh!

tveasey commented Sep 27, 2018

Uh oh!

dimitris-athanasiou left a comment

Uh oh!

dimitris-athanasiou Sep 28, 2018

Uh oh!

tveasey Sep 28, 2018 •

edited

Loading

Uh oh!

Uh oh!

[ML] Improve initialisation of the residual model after detecting new decomposition components #218

[ML] Improve initialisation of the residual model after detecting new decomposition components #218

Uh oh!

Conversation

tveasey commented Sep 27, 2018

Uh oh!

dimitris-athanasiou left a comment

Choose a reason for hiding this comment

Uh oh!

dimitris-athanasiou Sep 27, 2018

Choose a reason for hiding this comment

Uh oh!

tveasey Sep 27, 2018

Choose a reason for hiding this comment

Uh oh!

dimitris-athanasiou Sep 27, 2018

Choose a reason for hiding this comment

Uh oh!

dimitris-athanasiou Sep 27, 2018

Choose a reason for hiding this comment

Uh oh!

tveasey commented Sep 27, 2018

Uh oh!

dimitris-athanasiou left a comment

Choose a reason for hiding this comment

Uh oh!

dimitris-athanasiou Sep 28, 2018

Choose a reason for hiding this comment

Uh oh!

tveasey Sep 28, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tveasey Sep 28, 2018 •

edited

Loading