Panel Data Vignette (Issue 99) #115

mgyliu · 2022-08-03T18:47:37Z

Addresses issue #99

Changes:

New dataset for statcan employment panel data
Vignette demonstrating:
- Cleaning/putting data in epi_df format
- Using the non-epi epi_df with epi_recipe and epi_workflow
- Predicting with canned forecasters

R/data.R

dajmcdon · 2022-08-17T17:48:49Z

I gave this a quick once-over. I think the idea is great.

Major comments:

For the panel data model you built, can you be more specific/didactic about exactly the model you're fitting? It's perfectly safe to use math. Maybe also do an alternative model with a few more complexities?
After fitting, can you illustrate the standard model investigations one would undertake for a linear model? Examine fitted/observed values. Look at the coefficients and their CIs, etc. Stuff you'd do in an undergrad linear models class.

Minor:

There are a few typos I noticed. We can clean up later. The only important one is I think you used lag=c(0,1,1) instead of lag=0:2.

mgyliu · 2022-09-15T09:45:58Z

Thanks for the feedback & sorry this took so long - could you take another look @dajmcdon?

dajmcdon · 2024-02-06T18:31:53Z

Blocked by #291

rachlobay

This looks pretty great. A couple of minor things:

Suppress messages & warnings for the first code chunk (else we get a bunch of attach package messages, at least when I knit the .Rmd)
I made a couple of very minor changes to the sentences & fixed a few typos
The size of the model diagnostics plot appears too large/elongated when I knitted the file & look at the output? Maybe that’s just an issue on my computer, but I’ve resized it in a simpler way, just in case that appears like that for others as well.

I’ve fixed these minor changes & pushed the updates to Github. I have not fixed the following more important things yet because I think that they require your input...

Important things for you to weigh in on (presented by section):

Model fitting and prediction:

The intercept and other values discussed below the model output seem wrong (unless they are transformed or something?). Also, see the sentence on what coefficients are significantly greater than zero.
Is that correct to say…”lags at 2 years and 3 years ago have coefficients significantly greater than zero.” Because isn’t the maximum lag used correspond to 2 years ago? The current way of talking about lags also comes up in the section titled Model fitting & postprocessing.

Autoregressive model with exogenous inputs:

The model form seems wrong (currently shows interaction term, shouldn’t it be +)?

Model fitting & postprocessing:

I think that the model processing steps should be introduced and enumerated in the order they are performed (else it gets confusing to have the current out-of-order presentation).
I don’t see that the conclusions on significance are correct? Because, from a quick inspection, I thought that typically in R model output, one asterisk typically means “p < . 05”. If yeah, then there’s more/different terms that are significant than what’s currently indicated in the discussion of the model summary,

Flatline forecaster:

Is the flatline forecaster model form correct? I am not convinced that the alphas should be there if we are still just propagating a value ahead… If that’s right, then I think the model form should be as it is in https://cmu-delphi.github.io/cste-forecast-workshop-2023/#/canned-forecasters-that-work-out-of-the-box.-2.

Overall thoughts:

Related to my last point about the flatline forecaster presentation, I don’t see any location indices used anywhere or hats to indicate predictions? If we really want to be precise & consistent with other vignettes, presentations and tooling book chapters, it is probably good to include those. Refer to the previous link on the presentation you guys gave for an example of this.
There seems to be a general lack of plots… Currently, I see a line plot at the beginning when exploring the data and a model diagnostics plot. It may be nice to show how to look at/use the predictions for panel data instead of just showing the reader the code.
Related to my previous point: The vignette ends a little abruptly for me… Perhaps we should add a short summary at the end or a sentence or two after the final code block to suggest something for the reader to try on their own. Or we could expand the last section a bit (ex. add good plot to display for these results & discuss them briefly? A reader may find that useful and a nice way to end things).

I may be wrong about these points, but I think they are worth briefly talking about before merging.

…keys`

dajmcdon · 2024-04-26T18:23:43Z

@rachlobay All excellent points. I think I've hit them all. Would you mind giving it a quick once-over if you have a chance?

rachlobay · 2024-04-27T03:16:07Z

Looks great! I'll merge now

mgyliu force-pushed the ml-99-panel-data-vignette branch 2 times, most recently from bda4ea5 to fb030b5 Compare August 12, 2022 00:10

mgyliu changed the title ~~[WIP] [Issues 99 & 114] Panel Data Vignette & epi_keys_mold fix~~ [WIP] [Issues 99] Panel Data Vignette Aug 12, 2022

mgyliu added 5 commits August 15, 2022 16:16

wip panel data

027296e

update epi_keys_mold and tests to handle additional keys

6673d6c

put back files from rebase

a4b0dd7

fix data format

bd47fa8

wip vignette

0b797b0

mgyliu force-pushed the ml-99-panel-data-vignette branch from 1fd96bb to 0b797b0 Compare August 15, 2022 23:16

mgyliu added 5 commits August 15, 2022 21:12

updates to vignette

1683240

use a better dataset

3864d7d

use better data pt 2

c781d33

fix doc formatting

34ecc96

wording changes

2f1631d

mgyliu changed the title ~~[WIP] [Issues 99] Panel Data Vignette~~ Panel Data Vignette (Issue 99) Aug 17, 2022

mgyliu commented Aug 17, 2022

View reviewed changes

R/data.R Outdated Show resolved Hide resolved

mgyliu marked this pull request as ready for review August 17, 2022 16:56

mgyliu requested a review from dajmcdon as a code owner August 17, 2022 16:56

mgyliu force-pushed the ml-99-panel-data-vignette branch 2 times, most recently from 615cde4 to e6f62ec Compare September 15, 2022 09:04

mgyliu added 8 commits September 17, 2022 21:19

wip panel data

325f8a2

update epi_keys_mold and tests to handle additional keys

29d567c

put back files from rebase

b30b838

fix data format

340fe12

wip vignette

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

25674a5

updates to vignette

27f1387

use a better dataset

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

2a2c903

use better data pt 2

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

408e7c6

dajmcdon added 2 commits February 3, 2024 09:24

Merge branch 'dev' into ml-99-panel-data-vignette

ce40c50

ignore vignette caches

ebb60ad

dajmcdon mentioned this pull request Feb 3, 2024

Date processing mismatch #291

Closed

dajmcdon added 3 commits February 3, 2024 10:30

merge dev and minor revisions

009abf5

bug: blocked by #291

4871589

done. forecast_date/target_date processing is bolixed by #291

8552e1a

dajmcdon added 2 commits March 8, 2024 08:49

Merge branch 'dev' into ml-99-panel-data-vignette

Loading
Loading status checks…

a010db0

some simplifications

Loading
Loading status checks…

44a145f

dajmcdon mentioned this pull request Mar 18, 2024

291 date period #297

Merged

4 tasks

dajmcdon added 6 commits April 9, 2024 14:45

Merge branch 'dev' into ml-99-panel-data-vignette

943e180

move all vignette data to here vignettes/

Loading
Loading status checks…

4140ee1

export grad_employ_subset, redocument

12745fb

fix vignette to match

4296c16

checks pass

Loading
Loading status checks…

4dd2b1c

style and fix pkgdown

Loading
Loading status checks…

ec775a0

dajmcdon requested a review from rachlobay April 9, 2024 22:45

dajmcdon approved these changes Apr 9, 2024

View reviewed changes

dajmcdon mentioned this pull request Apr 12, 2024

Release version 0.1.0 / 1.0.0 #318

Open

18 tasks

rachlobay added 2 commits April 24, 2024 07:07

Minor fixes

e3dc215

styler

Loading
Loading status checks…

1ee300b

rachlobay reviewed Apr 24, 2024

View reviewed changes

dajmcdon added 2 commits April 26, 2024 11:15

address @rachellobay review, adjust canned printing to handle `other_…

cc988cb

…keys`

add a conclusion

Loading
Loading status checks…

55e8166

dajmcdon requested a review from rachlobay April 26, 2024 18:24

rachlobay merged commit 99c30c6 into dev Apr 27, 2024
3 checks passed

dajmcdon linked an issue May 23, 2024 that may be closed by this pull request

Vignette - panel data demos #99

Closed

dajmcdon deleted the ml-99-panel-data-vignette branch September 20, 2024 21:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Panel Data Vignette (Issue 99) #115

Panel Data Vignette (Issue 99) #115

mgyliu commented Aug 3, 2022 •

edited

Loading

dajmcdon commented Aug 17, 2022

mgyliu commented Sep 15, 2022

dajmcdon commented Feb 6, 2024

rachlobay left a comment

dajmcdon commented Apr 26, 2024

rachlobay commented Apr 27, 2024

Panel Data Vignette (Issue 99) #115

Panel Data Vignette (Issue 99) #115

Conversation

mgyliu commented Aug 3, 2022 • edited Loading

dajmcdon commented Aug 17, 2022

mgyliu commented Sep 15, 2022

dajmcdon commented Feb 6, 2024

rachlobay left a comment

Choose a reason for hiding this comment

dajmcdon commented Apr 26, 2024

rachlobay commented Apr 27, 2024

mgyliu commented Aug 3, 2022 •

edited

Loading