Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clustering #47

Open
jenniewilliams opened this issue Apr 4, 2023 · 1 comment
Open

Clustering #47

jenniewilliams opened this issue Apr 4, 2023 · 1 comment

Comments

@jenniewilliams
Copy link

I am really excited in applying these methods to my textual data and it appears to be working really well, however I have a question regarding the clustering - how does it decide on the number of clusters ?

@count0
Copy link
Contributor

count0 commented Jul 6, 2023

It selects the number of groups that minimizes the description length of the data — i.e. how many bits are necessary to describe according the model being fitted. More information are given in the paper. If you are interested, you can read more here: https://arxiv.org/abs/2112.00183

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants