New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
more group-based splitting methods #207
Comments
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
|
I second this too. I came across a data set that has a grouping variable. I already used |
|
This would be a great idea. :) I'm trying to bootstrap with repeated measures data and want all measurements from an individual to be in the same split. It seems like this grouped |
|
Yes @rkb965 in the meantime, you can create a custom split with for some examples on how to use it. |
|
Assuming that stratification variable stratifies the groups, could something like this have any sense: |
|
We'll want to add Since |
|
Having the |
|
Related: #284 |
|
From conversation with Julia (where we talked about this in reference to spatialsample), it seems like adding |
Looks like there's a desire for |
|
@mikemahoney218 Personally, I would love to see as many of the current resampling methods as is possible gain the ability to respect the hierarchical structure while resampling. Based on a quick skim this seems like it would include: I think |
|
Howdy folks -- just to update the thread, we've implemented As for stratification of grouped resamples, we've opened a new issue (#317) to try and collect some opinions on what people would expect stratification with groups to do. If you've got thoughts, please let us know over there! |
|
@mikemahoney218 this is awesome and just what I need, thank you! One thing I noticed though, is that set.seed doesn't seem to be applied to the splits? Is it possible to have set.seed allow the group_initial_split() to separate things in the same way? Here is an example. It does seem to stay the same when repeating the process in the same session, but if you restart R studio you'll see different colors each time. Thanks |
|
This issue has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue. |
It would be good to have an
initial_group_split(data, group, strata, prop)method that can split the data when there are groups (perhaps patients). Thestrataoption might be difficult when the outcome (or other stratification variable) is not constant within each group. We could also use the median or mode on the stratification variable and use that.Similarly, a
mc_group_cv()function would also be a good idea (using the splitting method as above).The text was updated successfully, but these errors were encountered: