[ML] Early stopping in the line searches to compute initial regulariser values #903

tveasey · 2019-12-13T11:43:07Z

This migrates to using BO for choosing points at which to evaluate the regularisers in the line searches we perform to find their initial values. Importantly, it also thresholds the minimum expected improvement to bother to continue with the line search to be at least 1% of the current test loss. This saves us up to 3 iterations in the loop (since we lower bound the minimum number of iterations we'll use) or around 40% of the cost of the line searches.

…icipation of upcoming work

valeriy42

Looks good altogether. Just a few minor comments.

docs/CHANGELOG.asciidoc

lib/api/unittest/CDataFrameAnalyzerFeatureImportanceTest.cc

valeriy42 · 2019-12-16T12:07:13Z

lib/api/unittest/CDataFrameAnalyzerFeatureImportanceTest.cc

+            BOOST_REQUIRE_CLOSE(c1, prediction, 3.0);
+            BOOST_REQUIRE_SMALL(c2, 0.25);
+            BOOST_REQUIRE_SMALL(c3, 0.25);
+            BOOST_REQUIRE_SMALL(c4, 0.25);


Thank you for catching the error in BOOST_REQUIRE_CLOSE call. Why the values c2, ... c4 became so much larger?

These numbers are slightly unstable. The reason they were 0 before is that none of these features were selected at all for training because of the threshold on MIC. Once they get selected there is scope for them to be non-zero. Since these are still so much smaller than c1, 0.25 vs ~100, I figured it was ok to just allow some leeway.

valeriy42 · 2019-12-16T12:13:26Z

lib/maths/CBayesianOptimisation.cc

 #include <maths/CSampling.h>
 #include <maths/CTools.h>

 #include <boost/math/distributions/normal.hpp>
+#include <boost/optional/optional_io.hpp>


do you need this include?

This is needed to print out boost::optional on this line.

I probably oversee something, by boost/optional.hpp is already include in the header. in the Line 177 I don't see any use of optional.

The header boost/optional.hpp is include in the header file. Is it still required here? Line 177 doesn't use optional, afaics.

In this line I've added expectedImprovement is type boost::optional<double>. This print state doesn't compile if I don't include optional_io.hpp

LOG_TRACE(<< "best = " << xmax.cwiseProduct(m_MaxBoundary - m_MinBoundary).transpose() << " EI(best) = " << expectedImprovement);

Unfortunately, the print functionality is not included by optional.hpp.

lib/maths/CBoostedTreeFactory.cc

valeriy42 · 2019-12-16T13:56:52Z

lib/maths/CBoostedTreeFactory.cc

+        minTestLoss.add(testLoss);
+        testLosses.emplace_back(regularizer, testLoss);
+    }
+    while (testLosses.size() > 0 && testLosses.size() < MAX_LINE_SEARCH_ITERATIONS) {


Do you need to check for the first condition?

Yes. This can happen in the loop which searches for the best downsample factor, i.e. applyRegularizer can return false for all values in the loop

for (auto regularizer : {intervalLeftEnd, (2.0 * intervalLeftEnd + intervalRightEnd) / 3.0, (intervalLeftEnd + 2.0 * intervalRightEnd) / 3.0, intervalRightEnd}) { ...

In that case, we'll just use downsample factor of 1 and this actually SIGSEGVs when trying to compute maximum expected loss without this condition.

valeriy42

LGTM

valeriy42 · 2019-12-17T12:16:08Z

lib/maths/CBoostedTreeFactory.cc

+    // This has the following steps:
+    //   1. Coarse search the interval [intervalLeftEnd, intervalRightEnd] using
+    //      fixed steps,
+    //   2. Fine tune, via Bayesian Optimisation targeting expected improvement,
+    //      and stop if the expected improvement small compared to the current
+    //      minimum test loss,
+    //   3. Calculate the parameter interval which gives the lowest test losses,
+    //   4. Fit an OLS quadratic approximation to the test losses in the interval
+    //      from step 3 and use it to estimate the best parameter value,
+    //   5. Compare the size of the residual errors w.r.t. to the OLS curve from
+    //      step 4 with its variation over the interval from step 3 and truncate
+    //      the returned interval if we can determine there is a low chance of
+    //      missing the best solution by doing so.


…er values (elastic#903)

…lariser values (#908) Backport #903.

tveasey added 3 commits December 12, 2019 13:25

Migrate to BO for line search and implement early stopping

c2c22f6

Merge branch 'master' into early-stopping-line-search

fde9b96

Tidy ups

8eef15e

tveasey added >enhancement review v8.0.0 :ml/DataFrameAnalysis v7.6.0 labels Dec 13, 2019

tveasey requested a review from valeriy42 December 13, 2019 11:43

tveasey added 6 commits December 13, 2019 11:49

Docs

7bed599

Update tests for all platforms: disables imbalanced class test in ant…

35bf073

…icipation of upcoming work

Some tweaks to CDataFrameAnalyzerFeatureImportanceTest tests

5fbffcd

Merge branch 'master' into early-stopping-line-search

c99bf2c

Formatting

0d42275

Expected memory usage

504478b

valeriy42 reviewed Dec 16, 2019

View reviewed changes

tveasey added 3 commits December 16, 2019 16:25

Better change log

f2fa5dd

Update out of date comments

317c332

Test fix

8c631a0

valeriy42 approved these changes Dec 17, 2019

View reviewed changes

tveasey merged commit a96c501 into elastic:master Dec 17, 2019

tveasey deleted the early-stopping-line-search branch December 17, 2019 12:39

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request Dec 17, 2019

[ML] Early stopping in the line searches to compute initial regularis…

906d86e

…er values (elastic#903)

tveasey mentioned this pull request Dec 17, 2019

[7.6][ML] Early stopping in the line searches to compute initial regulariser values #908

Merged

tveasey added a commit that referenced this pull request Dec 19, 2019

[7.6][ML] Early stopping in the line searches to compute initial regu…

884a82a

…lariser values (#908) Backport #903.

tveasey mentioned this pull request Jan 22, 2020

[ML] Bug in bounds calculation for the main hyperparameter optimisation search #965

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Early stopping in the line searches to compute initial regulariser values #903

[ML] Early stopping in the line searches to compute initial regulariser values #903

tveasey commented Dec 13, 2019

valeriy42 left a comment

valeriy42 Dec 16, 2019

tveasey Dec 16, 2019 •

edited

Loading

valeriy42 Dec 16, 2019

valeriy42 Dec 16, 2019

tveasey Dec 16, 2019 •

edited

Loading

valeriy42 Dec 16, 2019

valeriy42 Dec 16, 2019

tveasey Dec 16, 2019 •

edited

Loading

valeriy42 Dec 16, 2019

tveasey Dec 16, 2019

valeriy42 left a comment

valeriy42 Dec 17, 2019

[ML] Early stopping in the line searches to compute initial regulariser values #903

[ML] Early stopping in the line searches to compute initial regulariser values #903

Conversation

tveasey commented Dec 13, 2019

valeriy42 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tveasey Dec 16, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tveasey Dec 16, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tveasey Dec 16, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

valeriy42 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tveasey Dec 16, 2019 •

edited

Loading

tveasey Dec 16, 2019 •

edited

Loading

tveasey Dec 16, 2019 •

edited

Loading