[ML] Fix for "No counts available" error message #2414

edsavage · 2022-10-28T10:56:01Z

When restarting a look-back job as a real-time one on occasion error messages similar to

[CBucketGatherer.cc@464] No counts available at 1585698300, current bucket = [1585699200,1585700100)

are seen.

Investigation reveals that these error messages stem from the situation where an incomplete initial bucket for a partition was persisted at the close of the look-back job. When the job is re-opened as real-time the start time of the anomaly detector in question is still before that of the associated bucket gatherer, triggering the error.

This PR adds a check to determine if an incomplete initial bucket has been restored from persisted state, and if so to skip any attempt to output results for that bucket (which is essentially what would have happened prior to this change except without logging the error).

Fixes #2411

When restarting a look-back job as a real-time one on occasion error messages similar to '[CBucketGatherer.cc@464] No counts available at 1585698300, current bucket = [1585699200,1585700100)' are seen. Investigation reveals that these error messages stem from the situation where an incomplete initial bucket for a partition was persisted at the close of the look-back job. When the job is re-opened as real-time the start time of the anomaly detector in question is still before that of the associated bucket gatherer, triggering the error. This PR adds a check to determine if an incomplete initial bucket has been restored from persisted state, and if so to skip any attempt to output results for that bucket - which is essentially what would have happened prior to this change except without logging the error. Fixes elastic#2411

droberts195 · 2022-10-28T13:28:21Z

include/api/CAnomalyJob.h

@@ -496,8 +504,14 @@ class API_EXPORT CAnomalyJob : public CDataProcessor {
    //! Flag indicating whether or not time has been advanced.
    bool m_TimeAdvanced{false};

+    //! The initial value of the end time of the last bucket
+    //! out of latency window we've seen
+    core_t::TTime m_InitialLastFinalisedBucketEndTime{0};


Please expand the comment to say how this works for jobs that ran successfully for many buckets before being persisted by a version earlier than 8.6.

As far as I can see in this scenario this variable stays as 0 forever. Is that right? I think that’s OK because we don’t need the functionality for a job that ran successfully for many buckets. But even if I am right it’s really important to document that this variable cannot be assumed to be non-zero and must not be used for any other purpose because of this.

Or if I am wrong about it staying zero forever for previously successful jobs, how does it work?

Expanding on documentation of m_InitialLastFinalisedBucketEndTime.

tveasey

LGTM

edsavage added >bug review :ml v8.6.0 labels Oct 28, 2022

edsavage requested review from droberts195 and tveasey October 28, 2022 10:56

edsavage added 2 commits October 28, 2022 11:57

Update change log

56d6436

Fix formatting errors

3ff7d1d

droberts195 reviewed Oct 28, 2022

View reviewed changes

edsavage added 2 commits October 28, 2022 15:41

Attend to review comments.

fd41ab1

Expanding on documentation of m_InitialLastFinalisedBucketEndTime.

Typo

76d25e1

tveasey approved these changes Oct 31, 2022

View reviewed changes

edsavage merged commit baff94f into elastic:main Nov 1, 2022

edsavage deleted the no_counts_available branch November 1, 2022 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Fix for "No counts available" error message #2414

[ML] Fix for "No counts available" error message #2414

edsavage commented Oct 28, 2022

droberts195 Oct 28, 2022

tveasey left a comment

[ML] Fix for "No counts available" error message #2414

[ML] Fix for "No counts available" error message #2414

Conversation

edsavage commented Oct 28, 2022

droberts195 Oct 28, 2022

Choose a reason for hiding this comment

tveasey left a comment

Choose a reason for hiding this comment