Ar forecasting example #452

NathanielF · 2022-10-21T21:45:21Z

Forecasting AR Structural Timeseries models

Adding this merge request here to for the open issue: #450
The notebook has the broad structure of the related blog post and i'm happy to take any feedback or suggestions on streamlining it. I believe it follows the jupyter style advice accurately.

It seems to be passing the pre-commit checks locally. I found it a bit frustrating to get jupytext to pass.

Notebook follows style guide https://docs.pymc.io/en/latest/contributing/jupyter_style.html
PR description contains a link to the relevant issue: a tracker one for existing notebooks or a proposal one for new notebooks
Check the notebook is not excluded from any pre-commit check: https://github.com/pymc-devs/pymc-examples/blob/main/.pre-commit-config.yaml

Helpful links

https://github.com/pymc-devs/pymc-examples/blob/main/CONTRIBUTING.md

… notebook Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

review-notebook-app · 2022-10-21T21:45:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

drbenvincent · 2022-10-21T22:16:05Z

Thanks so much for the contribution so far!

Just set the remote checks running. Will try to review over the weekend, but might drift into early next week.

NathanielF · 2022-10-22T05:54:48Z

Thanks. Will have a look at that failing check this evening. Think it is failing on the codespell checks which were skipped for some reason when I ran it locally.

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF · 2022-10-22T07:47:56Z

Tried to address the failing codespell check. Seemed to work but not entirely sure.

lucianopaz

@NathanielF, first of all, thank you for submitting such an awesome example. It touches on a subject that has been completely neglected in the past pymc documentation.
However, there is a crucial subtlety about time series forecasting that was missed in your notebook: the future AR values are conditionally dependent on the past AR values that had been learnt. With the approach you did, setting the data of the AR you are sampling the past AR values, and not drawing them from their learnt posterior. The resampled AR will still be conditioned on the learnt coefficients, but the exact AR past values that are in the posterior will be ignored. I mentioned a way to work around this problem here, and maybe @ricardoV94 can share a gist he wrote that concatenates two random variables to do predictions without sampling what was learnt for the past values.
I would be very happy to help you out in fixing the forecasting, so let me know if my comments were clear or if you need more tips to get this to work.
As I said before and in one of my code comments, this has been lacking in the pymc documentation for a long time, and your contribution is a perfect opportunity to make things right!

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md

lucianopaz · 2022-10-22T08:17:39Z

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md

+```{code-cell} ipython3
+az.plot_trace(idata_ar, figsize=(10, 6), kind="rank_vlines");
+```
+


I think that it would be valuable to also plot the learnt latent AR variable over time here. You can do something like:

idata_ar.posterior.ar.mean(["chain", "draw"]).plot()

or also take advantage of arviz.hdi

I've added a plot here but were you thinking i should hit any notes of exposition as well?

I think that it looks nice enough not to require more explanation. Maybe @drbenvincent agrees?

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

… to be conditional on learned posterior Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

…the predict step pattern Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

…some more text Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF · 2022-10-25T09:39:05Z

Ooops, I didn't mean to remove the request for review from @drbenvincent.

Also, didn't mean to rush you @lucianopaz. Just wondering is the "request re-review" button is the right etiquette or would you already have seen the changes i've made since you requested them? Don't mean to put pressure on, just meant to signal that i think i've addressed the above.

lucianopaz

This looks way better now! I've added a bunch of comments. After you iterate through those, I think that the only things that might be left are a bunch of minor stylistic best practices, and potentially also run black and isort on the notebook to have it finally ready.

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md

NathanielF · 2022-10-27T10:58:31Z

Thanks so much for your feedback on this @lucianopaz i will adapt the above discussed and review for grammar and tone. Hopefully push some changes later today.

…model and improved plot labels. Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF · 2022-10-28T13:46:42Z

Ah, sorry @drbenvincent I did the same thing again. No idea why it knocked you out when i requested a review.

NathanielF · 2022-11-07T09:47:50Z

Just giving this a slight nudge @lucianopaz , @drbenvincent. Think it's nearly there and it'd be cool if we could get it over the line this week?

drbenvincent

Hi @NathanielF. Sorry about the delay on the review. I've been bogged down with client work.

It think this is excellent and will make a great addition.

I've added a bunch of comments. Feel free to ask if any clarification is needed.

PS. After these changes I'm happy to approve. That said, I'm not a time series expert, but as long as @lucianopaz is happy then I'm confident we can approve this soon.

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF · 2022-11-08T12:55:45Z

Thanks again for your time on this @drbenvincent, I know we're all busy. Happy to wait for @lucianopaz to give the above another look.

I think i've addressed all your comments and his prior comments. In the last commit there i've made the model-diagram neater for all models. I've also stressed that we're adding structural components not for arbitrary reasons but because real-world data tends to have multiple influences... and bayesian structural time-series modelling is one way of capturing theses multi-aspected data generating processes.

For @lucianopaz 's comments above - the main change was to revert to using the coefs pattern rather than having priors for two individual coefficients separately.

drbenvincent · 2022-11-08T13:10:32Z

Just saw this in a quick look now...
In the very final plot, the predicted mean seems not matched up with the shaded regions. Maybe worth a double check.

NathanielF · 2022-11-08T13:33:28Z

@drbenvincent do you mean at the tail end of the prediction is seems to come out of phase? Or just about the degree of oscillation? If i change the prior of the beta_fourier terms I get slightly more pronounced oscillation:

I wasn't too concerned about it. The point of the plot was to just show that we can recover the seasonality pattern. I think it does that...

I think it's too much of a rabbit hole to go down, to try and figure out if the percentile color-gradient technique is getting the color map banding exactly right. To my mind the color mapping is there just to suggest that the probabilistic outcomes come with a graded range of plausibility...

drbenvincent · 2022-11-08T13:35:18Z

It was the phase that I noticed. Thought I'd mention it in case or anything obvious.

NathanielF · 2022-11-08T13:37:44Z

Right, yeah... honestly not sure why that is happening.

NathanielF · 2022-11-08T15:19:16Z

I added more samples to the plot ,extended the prediction period for longer and allowed a wider sigma on the fourier terms. Still slight phasing visible, but it doesn't seem to be a preface to doom. I don't think it's anything to worry about.

lucianopaz

@NathanielF, thanks again for all of your amazing work! And thanks for the nudge, I had completely lost track of this PR...
I've reviewed it a bit deeper to find out why the forecast plots showed strange behavior. I found 3 issues:

The AR distribution you use for forecasting starts from the last time point of the training time. But then you combine it with trend and seasonality that are one time step into the future with respect to it. To fix this, I added an extra coordinate to the model that adds an extra step to the AR so that it has the last time step (used for the init), and all of the time points into the future that must be forecasted.
The Fourier features for the forecast with seasonality were not starting from the last time step, but from 0. This added a phase difference that appeared as a sort of discontinuity/inflection between the forecast and the training period.
The plots for the cyan lines of the forecast were using the correct future observed time steps, but the shaded areas were using a linspace, so they didn't match. Visually, this appeared like a sort of phase lag between the cyan line and the shaded areas.

After addressing these issues locally, the last plot looks like this:

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md

NathanielF · 2022-11-09T10:19:18Z

These are great observations @lucianopaz! Thanks for digging into it. I'll adjust and these and re-push today.

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

…t function and adjusted prediction step AR logic Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF · 2022-11-09T11:54:30Z

Thanks so much again @lucianopaz that last observation about the plotting function was subtle. I've been looking at the function too long to have seen it! I've adjust the notebook in the manner you suggested and indeed was able to recover a prediction plot with the phasing issues:

I added a small note about the AR logic in the comments to the code:

Do you think this is sufficient or should i add anything else?

lucianopaz

Thanks @NathanielF. It's almost ready. I just have two very minor nitpicks before approving

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md

…prediction for mean and removed redundant argument from plot_fits function Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF · 2022-11-09T14:36:54Z

Thanks @lucianopaz I fixed those last two issues. It was really great working on this issue with yourself and @drbenvincent I learned a tonne!

lucianopaz

Great work @NathanielF! Thank you so much for contributing this!

NathanielF · 2022-11-09T15:00:55Z

Fantastic. This has been a great experience. I hope to follow it up shortly with another pull request on the Bayesian VAR models. Thanks again for your time on this!

NathanielF added 3 commits October 21, 2022 20:44

[Forecasting with AR models pymc-devs#450] first draft AR forecasting…

e212fa0

… notebook Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

[Forecasting with AR models pymc-devs#450] adding myst notebook

9b3c4cf

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

[Forecasting with AR models pymc-devs#450] passing pre-commit checks

bebb8ad

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

drbenvincent requested review from drbenvincent and lucianopaz October 21, 2022 22:12

[Forecasting with AR models pymc-devs#450] codespell check

794fbc4

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

lucianopaz requested changes Oct 22, 2022

View reviewed changes

NathanielF and others added 6 commits October 22, 2022 17:36

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

d2708c4

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

b3cc3c1

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

32a68ed

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

[Forecasting with AR models pymc-devs#450] changed AR predict process…

c6127c7

… to be conditional on learned posterior Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

[Forecasting with AR models pymc-devs#450] tidying text introducting …

14f3da3

…the predict step pattern Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

[Forecasting with AR models pymc-devs#450] remove some typos and add …

1e605f4

…some more text Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF requested review from lucianopaz and removed request for drbenvincent October 25, 2022 09:35

drbenvincent self-requested a review October 25, 2022 10:52

lucianopaz requested changes Oct 27, 2022

View reviewed changes

[Forecasting with AR models pymc-devs#450] reverted coefs pattern in …

5e8f2ec

…model and improved plot labels. Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

NathanielF requested review from lucianopaz and removed request for drbenvincent October 28, 2022 13:44

drbenvincent self-requested a review November 8, 2022 09:56

drbenvincent reviewed Nov 8, 2022

View reviewed changes

NathanielF added 2 commits November 8, 2022 12:26

updated with Ben's comments

7bf2103

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

passing checks

867f14d

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

lucianopaz requested changes Nov 9, 2022

View reviewed changes

NathanielF and others added 7 commits November 9, 2022 10:24

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

2afcccc

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

8d8de51

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

0407df1

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

e91c592

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

ccc331e

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

Update myst_nbs/time_series/Forecasting_with_structural_timeseries.my…

2b10826

…st.md Co-authored-by: Luciano Paz <luciano.paz.neuro@gmail.com>

[Forecasting with AR models pymc-devs#450] fixed final prediction plo…

60bc521

…t function and adjusted prediction step AR logic Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

lucianopaz reviewed Nov 9, 2022

View reviewed changes

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md Outdated Show resolved Hide resolved

myst_nbs/time_series/Forecasting_with_structural_timeseries.myst.md Outdated Show resolved Hide resolved

[Forecasting with AR models pymc-devs#450] corrected mistaken median …

dd5f186

…prediction for mean and removed redundant argument from plot_fits function Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

lucianopaz approved these changes Nov 9, 2022

View reviewed changes

lucianopaz merged commit d6fcd3d into pymc-devs:main Nov 9, 2022

Ar forecasting example #452

Ar forecasting example #452

Uh oh!

Conversation

NathanielF commented Oct 21, 2022

Forecasting AR Structural Timeseries models

Helpful links

Uh oh!

review-notebook-app bot commented Oct 21, 2022

Uh oh!

drbenvincent commented Oct 21, 2022

Uh oh!

NathanielF commented Oct 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NathanielF commented Oct 22, 2022

Uh oh!

lucianopaz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lucianopaz Oct 22, 2022

Choose a reason for hiding this comment

Uh oh!

NathanielF Oct 22, 2022

Choose a reason for hiding this comment

Uh oh!

lucianopaz Oct 27, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NathanielF commented Oct 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucianopaz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NathanielF commented Oct 27, 2022

Uh oh!

NathanielF commented Oct 28, 2022

Uh oh!

NathanielF commented Nov 7, 2022

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NathanielF commented Nov 8, 2022

Uh oh!

drbenvincent commented Nov 8, 2022

Uh oh!

NathanielF commented Nov 8, 2022

Uh oh!

drbenvincent commented Nov 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NathanielF commented Nov 8, 2022

NathanielF commented Oct 22, 2022 •

edited

Loading

NathanielF commented Oct 25, 2022 •

edited

Loading

drbenvincent commented Nov 8, 2022 •

edited

Loading