Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nflverse_download() should provide an argument to reload specific data from a release #168

Closed
mrcaseb opened this issue Oct 31, 2022 · 2 comments

Comments

@mrcaseb
Copy link
Member

mrcaseb commented Oct 31, 2022

We have implemented the use_hive argument to provide an easy way to use arrow::open_dataset, esp. for pbp data.

If someone wants to actually use this in-season, we have to implement an argument that allows to download the most recent season only (like the force_rebuild argument in nflfastR::update_db()). Otherwise there is no easy way to update current season pbp without loading everything else as well.

I have described the force_rebuild argument as follows:

grafik

@mrcaseb
Copy link
Member Author

mrcaseb commented Oct 31, 2022

So what I want is to make this code

nflreadr::nflverse_download(
  pbp, 
  folder_path = "some_path", 
  file_type = "parquet",
  use_hive = TRUE,
  reload = 2022
)

to only download 2022 pbp into the correct folder.

@mrcaseb
Copy link
Member Author

mrcaseb commented Oct 31, 2022

I am an idiot. A timestamp check is included. The function will only download if something has changed

@mrcaseb mrcaseb closed this as completed Oct 31, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant