Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove data argument from augment() signature #32

Merged
merged 1 commit into from
Nov 29, 2018
Merged

Remove data argument from augment() signature #32

merged 1 commit into from
Nov 29, 2018

Conversation

alexpghayes
Copy link
Contributor

Results in warnings for broom::augment.stl() and broom::augment.rowwise_df(), which don't have a data argument. Both of these functions are likely to be deprecated at some point so that we can add the data argument back, but this currently results in a WARNING that may or may not bother the CRAN people as I work on some quick patches for the broom 0.5.1 release.

Not sure if this would even make it to CRAN in time to matter, but I could at least tell them it's in the pipeline. Advice on whether or not this change really matters appreciated.

@alexpghayes
Copy link
Contributor Author

Note: the patches are currently off of the 0.5.1 branch of broom, i.e. https://github.com/tidymodels/broom/tree/0.5.1

@hadley
Copy link
Member

hadley commented Nov 22, 2018

What's the motivation for this? It seems like a step in the wrong direction to me, as it implies that models have to store their data.

@alexpghayes
Copy link
Contributor Author

I don't think it implies that models have to store their data, but rather that they have to store their predictions, or whatever observation level information they generate. broom so far has typically dealt with models that nicely map data from some input space to some response space, but there's no need for this to always be the case.

  • Unsupervised dimension reduction methods like t-SNE don't allow mapping new data from the input space to the new space
  • Decompositions into trend, seasonal and noise components of a time series may not apply to new data

There's a whole class of (mainly unsupervised) transformations that you can think of as being a "non-repeatable" map into some new space.

@hadley
Copy link
Member

hadley commented Nov 22, 2018

Then my sense would be that should be a separate generic. It feels to me like this need more discussion; it's a small technical change but it feels like a bit philosophical one.

@alexpghayes
Copy link
Contributor Author

alexpghayes commented Nov 25, 2018

I hear you. However, augment() has hitherto not had a data argument in a CRAN version of broom, so unless this gets removed, it will force unannounced breaking changes and deprecations in broom, broom.mixed and sparklyr.

@hadley
Copy link
Member

hadley commented Nov 26, 2018

Oh if this is just reverting a new feature that's causing problems, I don't have any objections.

@topepo topepo merged commit 2c50061 into r-lib:master Nov 29, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants