`moving_avg` forecasts should not include current point #11641

polyfractal · 2015-06-12T20:16:50Z

Forecasts should include all points up to, but not including, the current point.
Fixes some tests
Removes the "Gap" tests, which have proven to be super fragile due to accumulated edge cases, and aren't even very useful anymore because the mockHisto stuff generates randomly sized gaps.
Removes the concrete implementation of predict(), which makes things simpler / more intuitive

uboness · 2015-06-13T02:06:56Z

core/src/main/java/org/elasticsearch/search/aggregations/pipeline/movavg/models/EwmaModel.java

+        double[] predictions = new double[numPredictions];
+
+        // If there are no values, we can't do anything.  Return an array of NaNs.
+        if (values.size() == 0) {


values.isEmpty()

colings86 · 2015-06-15T07:36:35Z

.../src/main/java/org/elasticsearch/search/aggregations/pipeline/movavg/models/LinearModel.java

@@ -41,6 +42,29 @@
    protected static final ParseField NAME_FIELD = new ParseField("linear");

    @Override
+    public <T extends Number> double[] predict(Collection<T> values, int numPredictions) {


The two checks (values.size == 0 and numPredictions < 1) seem to be done in all the models, maybe we should move them to the super class and have the sub-classes implement doPredict() which is called from the predict() method in the super class?

in fact, to make all the models work in the same way, we could make MovAvgModel declare the following abstract method:

public abstract <T extends Number> double[] next(Collection<T> values, int numForecasts);

Then the predict and next methods can just be declared in MovAvgModel and be the following:

public final <T extends Number> double[] predict(Collection<T> values, int numPredictions) { return next(values, numPredictions); } public final <T extends Number> double next(Collection<T> values) { return next(values, 1)[0]; }

I went with the first suggestion, rolling the checks/assertion into predict() which then calls down to doPredict().

I couldn't do the second suggestion, because not all the models follow that pattern. Simple / Linear / Ewma only know how to forecast one value at a time. E.g. they always return a single value, not an array. Which also means their prediction code needs to fill the array of predictions with that last value, whereas HL/HW generate new predictions at each point.

If we want less code c/p, going back to the old solution is probably best (concrete predict() for simple/linear/ewma and have HL/HW override it).

polyfractal · 2015-06-18T19:35:39Z

@uboness @colings86 Sorry for delay, fixes applied

As an aside, the work I've been doing on the optimizer is going to need a fair amount of change to the next() and predict() interfaces...the current setup just isn't going to be ergonomic enough to deal with an optimizer changing model parameters constantly.

colings86 · 2015-06-22T08:35:29Z

LGTM

…apoint. - Fixes tests, and removes a few special snowflake, fragile tests. - Removes concrete implementation of predict() and moves it into each model so that the logic is clearer. Because there is some shared checks/assertions, those remain in predict() and the main prediction happens in doPredict()

Aggregations: Moving average forecasts should not include current point

$polyfractal$

$@polyfractal$ polyfractal added v2.0.0-beta1 :Analytics/Aggregations Aggregations labels Jun 12, 2015

uboness reviewed Jun 13, 2015
View reviewed changes

clintongormley changed the title ~~Aggregations: moving_avg forecasts should not include current point~~ moving_avg forecasts should not include current point Jun 13, 2015

clintongormley added the >bug label Jun 13, 2015

colings86 reviewed Jun 15, 2015
View reviewed changes

$@polyfractal$ polyfractal force-pushed the bugfix/movavg_predict branch from 9280da5 to 513a673 Compare June 18, 2015 19:32

$@polyfractal$ polyfractal force-pushed the bugfix/movavg_predict branch from 513a673 to 5d94feb Compare June 22, 2015 15:16

polyfractal added a commit that referenced this pull request Jun 22, 2015

$@polyfractal$

Merge pull request #11641 from polyfractal/bugfix/movavg_predict

2ebb44d

Aggregations: Moving average forecasts should not include current point

$@polyfractal$ polyfractal merged commit 2ebb44d into elastic:master Jun 22, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`moving_avg` forecasts should not include current point #11641

`moving_avg` forecasts should not include current point #11641

$@polyfractal$ polyfractal commented Jun 12, 2015

uboness Jun 13, 2015

colings86 Jun 15, 2015

colings86 Jun 15, 2015

$@polyfractal$ polyfractal Jun 18, 2015

polyfractal commented Jun 18, 2015

colings86 commented Jun 22, 2015

moving_avg forecasts should not include current point #11641

moving_avg forecasts should not include current point #11641

Conversation

polyfractal commented Jun 12, 2015

uboness Jun 13, 2015

Choose a reason for hiding this comment

colings86 Jun 15, 2015

Choose a reason for hiding this comment

colings86 Jun 15, 2015

Choose a reason for hiding this comment

polyfractal Jun 18, 2015

Choose a reason for hiding this comment

polyfractal commented Jun 18, 2015

colings86 commented Jun 22, 2015

`moving_avg` forecasts should not include current point #11641

`moving_avg` forecasts should not include current point #11641

$@polyfractal$ polyfractal commented Jun 12, 2015

$@polyfractal$ polyfractal Jun 18, 2015