add async trainModel #81

wnbts · 2020-04-08T23:06:54Z

This change adds a new async trainModel implementation with the same business logic to replace the current synchronous implementation.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

kaituo · 2020-04-09T18:45:41Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/ml/ModelManager.java

+        }
+    }
+
+    private void trainModelForIteration(


iteration means some repeated steps. Suggest to rename to step.

kaituo · 2020-04-09T18:49:09Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/ml/ModelManager.java

+                Entry<Integer, Integer> partitionResults = getPartitionedForestSizes(
+                    RandomCutForest
+                        .builder()
+                        .dimensions(rcfNumFeatures)
+                        .sampleSize(rcfNumSamplesInTree)
+                        .numberOfTrees(rcfNumTrees)
+                        .outputAfter(rcfNumSamplesInTree)
+                        .parallelExecutionEnabled(false)
+                        .build(),
+                    anomalyDetector.getDetectorId()
+                );


I changed this in another PR: https://github.com/opendistro-for-elasticsearch/anomaly-detection/pull/83/files#diff-0ba3da6c04a6db2df8146de98b12d850

This is to have a single place to get the number of partitioned forests. Previously, we have redundant code in both ModelManager and ADStateManager.

If you agree, please use the changed getPartitionedForestSizes.

the changes in that pr are currently unavailable in dev branch. if that is checked in first, this pr can be updated based on that. Or if this pr is checked in first, the refactoring can be done in a separate pr.

fair enough.

ylwu-amzn · 2020-04-10T06:04:14Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/ml/ModelManager.java

+    *                 onFailure is called IllegalArgumentException when training data is invalid
+    *                 onFailure is called LimitExceededException when a limit for training is exceeded
+    */
+    public void trainModel(AnomalyDetector anomalyDetector, double[][] dataPoints, ActionListener<Void> listener) {


Just transform the sync method to callback, not change any logic, right?

yes, changed to async inside out

ylwu-amzn

LGTM. Thanks for the change.

wnbts marked this pull request as ready for review April 8, 2020 23:20

kaituo reviewed Apr 9, 2020

View reviewed changes

add async trainModel

7f4f515

kaituo approved these changes Apr 9, 2020

View reviewed changes

ylwu-amzn reviewed Apr 10, 2020

View reviewed changes

ylwu-amzn approved these changes Apr 10, 2020

View reviewed changes

wnbts merged commit f851f1a into opendistro-for-elasticsearch:development Apr 10, 2020

kaituo pushed a commit to kaituo/anomaly-detection that referenced this pull request Apr 13, 2020

add async trainModel (opendistro-for-elasticsearch#81)

d023166

wnbts deleted the mm1-train branch September 11, 2020 01:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add async trainModel #81

add async trainModel #81

wnbts commented Apr 8, 2020 •

edited

kaituo Apr 9, 2020

wnbts Apr 9, 2020

kaituo Apr 9, 2020

wnbts Apr 9, 2020

kaituo Apr 9, 2020

ylwu-amzn Apr 10, 2020

wnbts Apr 10, 2020

ylwu-amzn left a comment

add async trainModel #81

add async trainModel #81

Conversation

wnbts commented Apr 8, 2020 • edited

kaituo Apr 9, 2020

Choose a reason for hiding this comment

wnbts Apr 9, 2020

Choose a reason for hiding this comment

kaituo Apr 9, 2020

Choose a reason for hiding this comment

wnbts Apr 9, 2020

Choose a reason for hiding this comment

kaituo Apr 9, 2020

Choose a reason for hiding this comment

ylwu-amzn Apr 10, 2020

Choose a reason for hiding this comment

wnbts Apr 10, 2020

Choose a reason for hiding this comment

ylwu-amzn left a comment

Choose a reason for hiding this comment

wnbts commented Apr 8, 2020 •

edited