You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This was previously discussed in tidymodels/recipes#399; I wasn't sold on what the poster wanted to return and they added a step function to their own package.
You might also want to take a look at tidymodels/planning#12. For non-preprocessing needs, I think that @kbodwin's thoughts are spot-on.
It would be good as long as the output is a factor variable that denotes the cluster that the sample belongs to. For new data, we can use a nearest centroid (or mediod) approach to assign new samples (but this is dependent on the clustering method).
I'd support this but don't have the time to do it; you'd have to start a PR.
This issue has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.
Love
embed
. It would be super awesome if there was astep_kmeans()
orstep_cluster()
that added cluster assignments to a data frame.Why?
Cluster assignments are super important for segmentation. K-Means and similar algorithms (e.g. K-modes) can help us to identify customer groups.
Embed
Embed is a good spot for this.
step_umap()
is a similar algorithm that I often use in combination with K-Means.Let me know what you think.
Thanks, Matt
The text was updated successfully, but these errors were encountered: