[ML] Improvements to upfront memory estimation for data frame analyses #1003

tveasey · 2020-02-17T10:36:54Z

This change fixes a double counting bug for the memory used by the extra columns for classification and regression model training. It also means they only count towards the data frame memory usage: previously they were wrongly being treated as features for memory estimation purposes.

Incidentally, it also fixes the memory reported by the counter E_DFOEstimatedPeakMemoryUsage, which was missing the extra columns' memory usage.

Finally, it tidies up instrumentation of outlier detection, to more fully use the new instrumentation class, and corrects the memory estimates in testRunOutlierDetectionPartitioned.

tveasey · 2020-02-17T20:11:34Z

retest

droberts195

LGTM

elastic#1003)

…alyses (#1010) Backport #1003.

tveasey added 2 commits February 17, 2020 10:18

Memory estimation and monitoring improvements

7bae721

More testing

b66de57

tveasey added >bug review v8.0.0 :ml/DataFrameAnalysis v7.7.0 labels Feb 17, 2020

Docs

220459b

tveasey mentioned this pull request Feb 17, 2020

[ML] Improvements to regression and classification memory handling #995

Closed

4 tasks

tveasey added 2 commits February 17, 2020 12:10

Test updates

81c3793

Relax thresholds for unix

20d6afb

tveasey mentioned this pull request Feb 17, 2020

[ML] Mute integration test which fails after improving upfront memory estimation for data frame analyses elastic/elasticsearch#52434

Merged

droberts195 approved these changes Feb 18, 2020

View reviewed changes

tveasey merged commit 6750684 into elastic:master Feb 18, 2020

tveasey deleted the memory-bugs branch February 18, 2020 12:48

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request Feb 18, 2020

[ML] Improvements to upfront memory estimation for data frame analyses (

d12416f

elastic#1003)

tveasey mentioned this pull request Feb 18, 2020

[7.7][ML] Improvements to upfront memory estimation for data frame analyses #1010

Merged

tveasey added a commit that referenced this pull request Feb 18, 2020

[7.7][ML] Improvements to upfront memory estimation for data frame an…

e5b32d4

…alyses (#1010) Backport #1003.

pheyos mentioned this pull request Feb 25, 2020

[ML] Functional tests - adjust classification model memory elastic/kibana#58445

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Improvements to upfront memory estimation for data frame analyses #1003

[ML] Improvements to upfront memory estimation for data frame analyses #1003

tveasey commented Feb 17, 2020

tveasey commented Feb 17, 2020

droberts195 left a comment

[ML] Improvements to upfront memory estimation for data frame analyses #1003

[ML] Improvements to upfront memory estimation for data frame analyses #1003

Conversation

tveasey commented Feb 17, 2020

tveasey commented Feb 17, 2020

droberts195 left a comment

Choose a reason for hiding this comment