Naive prediction intervals #4127

eccabay · 2023-04-06T12:59:05Z

codecov · 2023-04-06T13:06:51Z

Codecov Report

Merging #4127 (eb86336) into main (8d49f35) will increase coverage by 0.1%.
The diff coverage is 100.0%.

@@           Coverage Diff           @@
##            main   #4127     +/-   ##
=======================================
+ Coverage   99.7%   99.7%   +0.1%     
=======================================
  Files        349     349             
  Lines      37704   37740     +36     
=======================================
+ Hits       37587   37623     +36     
  Misses       117     117

Impacted Files	Coverage Δ
evalml/pipelines/time_series_pipeline_base.py	`100.0% <100.0%> (ø)`
...valml/pipelines/time_series_regression_pipeline.py	`100.0% <100.0%> (ø)`
.../tests/pipeline_tests/test_time_series_pipeline.py	`100.0% <100.0%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

* Remove drop_time_index from get_prediction_intervals * drop time index only for transform and not for predict * Maintain datetime frequency in drop_time_index * Make sure indices for residuals match

…ML-7064_naieve_PI

jeremyliweishih

Some questions

jeremyliweishih · 2023-04-07T17:21:34Z

evalml/pipelines/time_series_pipeline_base.py

        predictions = self._estimator_predict(features)
-        predictions.index = y.index
+        if not calculating_residuals:


why don't we want this when calculating_residuals?

So, what I've found and correct me if I'm wrong @eccabay , when we do a truly in-sample prediction the prediction size is not guaranteed to be the same size as the target values array...because of the lags I think.

I think I need to go back and do a little more experimentation here. You're right @chukarsten that the lags and dropping null rows means that the prediction size won't be the same size as the target values array. However, that's not the case if we've precomputed features with the DFS transformer, I had to add the index alignment @jeremyliweishih commented on to get this to work in the product. I'll do some experimenting to see if I can consolidate the two cases, and get something that makes a bit more sense.

I revisited this, it looks like a simpler way to keep this safe is to just limit setting the indices to be equal for when len(predictions) == len(y), it removes the need for the dfs check as well. I've updated accordingly but can always change it again if desired!

jeremyliweishih · 2023-04-07T17:22:55Z

evalml/pipelines/time_series_regression_pipeline.py

            )
+            if self.component_graph.has_dfs:


why do we only need to do this for the has_dfs case?

evalml/tests/pipeline_tests/test_time_series_pipeline.py

jeremyliweishih · 2023-04-07T17:30:32Z

evalml/pipelines/time_series_regression_pipeline.py

+
+            res_dict = {}
+            cov_to_mult = {0.75: 1.15, 0.85: 1.44, 0.95: 1.96}
+            for cov in coverage:


do we have explicit coverage re the math here? If not can we add it? If my understanding is correct, In test_time_series_pipeline_get_prediction_intervals it looks like we're only testing against the estimator prediction intervals which this implementation shouldn't be tested against.

I'm always hesitant about testing the math - it's too easy to just create a repeat of what we call in the function, which doesn't serve any purpose other than checking that we haven't changed our implementation. Do you have any suggestions for how to test the math?

We can test if the PIs logically make sense:

if the values are increasing as the number of predictions

if the values are increasing based off of the coverage that is selected

my main concern is that the test is asserting against the estimator PIs which we don't use anymore! Let me know what you think.

If we decide to go down that route, and I'm not super in love with it, I think we'd have to functionalize both of these branches and unit test them individually. I would want the tests to be about as simple as we could make them.

I'm not quite sure what you mean @chukarsten - I added some testing to check the math (ish), do you want to take a look and see what you think? I've been on the fence about splitting the current test up into two, so I'm also happy to do that if you'd prefer simpler tests.

jeremyliweishih

great work!

chukarsten

I think the only thing that's confusing to me is the parameterization of the test using features. Not sure what that's meant to say or what it's really testing. I think we need some additional explanation there for future devs coming down this road.

chukarsten · 2023-04-07T20:37:06Z

evalml/tests/pipeline_tests/test_time_series_pipeline.py

 @pytest.mark.parametrize("set_coverage", [True, False])
 @pytest.mark.parametrize("add_decomposer", [True, False])
 @pytest.mark.parametrize("no_preds_pi_estimator", [True, False])
+@pytest.mark.parametrize("features", [True, False])


We might need some inline comment coverage on these. It always takes me a while to understand no_preds_pi_estimator and what that's testing for. Now I'm a little confused what features being True or False means.

👍 I've renamed features to featuretools_first and no_preds_pi_estimator to ts_native_estimator and added a couple comments, let me know what you think!

chukarsten and others added 3 commits March 27, 2023 09:48

Initial attempt.

b60167f

Teests pass.

df06e23

Merge branch 'main' into TML-7064_naieve_PI

f66ce48

eccabay added 5 commits April 6, 2023 10:34

Time index fixes (#4128)

2dfd146

* Remove drop_time_index from get_prediction_intervals * drop time index only for transform and not for predict * Maintain datetime frequency in drop_time_index * Make sure indices for residuals match

Update release notes

2450dea

Only have indices match if dfs in play

9a45ab9

Extend test to check for predefined features case

cdcb561

Pinning sktime temporarily to see if tests work

0857b38

eccabay marked this pull request as ready for review April 7, 2023 13:34

auto-assign bot assigned eccabay Apr 7, 2023

eccabay requested review from christopherbunn, jeremyliweishih, tamargrey and chukarsten April 7, 2023 13:35

eccabay and others added 3 commits April 7, 2023 10:19

Unpin sktime

25c1873

Merge branch 'main' into TML-7064_naieve_PI

707596c

Merge branch 'TML-7064_naieve_PI' of github.com:alteryx/evalml into T…

9706254

…ML-7064_naieve_PI

jeremyliweishih requested changes Apr 7, 2023

View reviewed changes

eccabay added 2 commits April 7, 2023 15:41

Update pi test to check interval assumptions

7bc63f8

Simplify predict_in_sample index checking

fe6369b

eccabay requested a review from jeremyliweishih April 7, 2023 20:01

jeremyliweishih approved these changes Apr 7, 2023

View reviewed changes

chukarsten suggested changes Apr 7, 2023

View reviewed changes

Rename variables and add test comments for clarity

eb86336

eccabay requested a review from chukarsten April 10, 2023 13:39

chukarsten approved these changes Apr 10, 2023

View reviewed changes

chukarsten merged commit 401bf9a into main Apr 10, 2023

chukarsten deleted the TML-7064_naieve_PI branch April 10, 2023 17:35

chukarsten mentioned this pull request Apr 10, 2023

Release v0.73.0. #4122

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Naive prediction intervals #4127

Naive prediction intervals #4127

eccabay commented Apr 6, 2023

codecov bot commented Apr 6, 2023 •

edited

Loading

jeremyliweishih left a comment

jeremyliweishih Apr 7, 2023

chukarsten Apr 7, 2023

eccabay Apr 7, 2023

eccabay Apr 7, 2023

jeremyliweishih Apr 7, 2023

jeremyliweishih Apr 7, 2023

eccabay Apr 7, 2023

jeremyliweishih Apr 7, 2023

chukarsten Apr 7, 2023

eccabay Apr 7, 2023

jeremyliweishih left a comment

chukarsten left a comment

chukarsten Apr 7, 2023

eccabay Apr 10, 2023

Naive prediction intervals #4127

Naive prediction intervals #4127

Conversation

eccabay commented Apr 6, 2023

codecov bot commented Apr 6, 2023 • edited Loading

Codecov Report

jeremyliweishih left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jeremyliweishih left a comment

Choose a reason for hiding this comment

chukarsten left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Apr 6, 2023 •

edited

Loading