Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stratification in grouped resampling #317

Open
mikemahoney218 opened this issue Jun 28, 2022 · 1 comment
Open

Stratification in grouped resampling #317

mikemahoney218 opened this issue Jun 28, 2022 · 1 comment
Labels
discussion feature a feature request or enhancement

Comments

@mikemahoney218
Copy link
Contributor

@mikemahoney218 mikemahoney218 commented Jun 28, 2022

Feature

As part of closing #207, we've recently implemented a number of grouping functions, with group_mc_cv() (#313), group_initial_split() and group_validation_split() (#315), and group_bootstraps() (#316).

Right now, none of these functions support stratification -- which would be useful if, for instance, you had repeated measurements of a number of patients and needed to stratify by outcome. We haven't included this partially so that we could implement grouped resampling quickly, but also because we aren't exactly sure what people would expect stratification to do when resampling by groups. Specific questions include:

  • How should strata be determined when the stratification variable isn't constant within a group? Median, mode, user-provided functions? What's a good default option?
  • What rules can we use to determine when a (group x strata) needs to be pooled with others?

If anyone has any thoughts on what they'd expect stratification to do in grouping functions, let us know here!

@mikemahoney218 mikemahoney218 added feature a feature request or enhancement discussion labels Jun 28, 2022
@juliasilge
Copy link
Member

@juliasilge juliasilge commented Jun 29, 2022

Posted on RStudio Community and Twitter

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion feature a feature request or enhancement
Projects
None yet
Development

No branches or pull requests

2 participants