-
Notifications
You must be signed in to change notification settings - Fork 123
Closed
Labels
tidy-dev-day 🤓Tidyverse Developer Day rstd.io/tidy-dev-dayTidyverse Developer Day rstd.io/tidy-dev-day
Description
Would like if threshold argument within step_other() could be specified by an integer (rather than just by proportion), or had a separate n argument to specify the minimum sample size before it is collapsed into the "other" category.
E.g. to specify minimum sample size of 30:
recipe(price ~ clarity + color + carat, data = diamonds) %>%
step_other(all_nominal(), n = 30)I'm curious what the best way of doing this is currently? (Link to hack for specifying consistent sample size to step_other across datasets of different sizes.)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
tidy-dev-day 🤓Tidyverse Developer Day rstd.io/tidy-dev-dayTidyverse Developer Day rstd.io/tidy-dev-day