Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider partition data reduction algorithm #66

Open
malcolmbarrett opened this issue Jul 27, 2022 · 3 comments
Open

Consider partition data reduction algorithm #66

malcolmbarrett opened this issue Jul 27, 2022 · 3 comments
Labels
New Engine 🚗 Add new engine

Comments

@malcolmbarrett
Copy link

the Partition algorithm has several nice properties, particularly in high dimensions. It's also completely reproducible without a seed. https://github.com/USCbiostats/partition

@EmilHvitfeldt EmilHvitfeldt added the New Engine 🚗 Add new engine label Aug 30, 2022
@EmilHvitfeldt
Copy link
Member

Correct me if I'm wrong, but isn't this method closer to a dimensionality reduction method?

@malcolmbarrett
Copy link
Author

Now that you mention it, the clustering is really happening at the column level, not the row level like this package is focused on. Maybe better suited to a recipe somewhere?

@EmilHvitfeldt
Copy link
Member

If a trained application can be reapplied to new data, then it would be perfect as a recipe step!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
New Engine 🚗 Add new engine
Projects
None yet
Development

No branches or pull requests

2 participants