-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Detecting/identifying number of clusters present in dataset #19
Comments
You can try successive numbers of clusters and measure their performance (basically an error curve). Why not create a pull request implementing this? |
@agarie what I did a while ago was an Anova for a successive number of clusters to see how well the clusters explain the variance. |
Hi all,
|
Does AI4r have any way to let me detect the number of clusters present/inherent in a dataset (for use with a clustering algorithm), for example I would like to be able to say something like:
Best fit: 5 clusters, 99% suitable
Second best fit: 4 clusters, 88% suitable.
Currently I require a user to enter the number of clusters but this is just guess work then...
The text was updated successfully, but these errors were encountered: