Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to configure number of clustering heads T, if clustering is priority #12

Closed
ksulima opened this issue Sep 1, 2020 · 3 comments
Closed

Comments

@ksulima
Copy link

ksulima commented Sep 1, 2020

Hi,

You say in a paper 3.5 Since our main objective is to use clustering to learn a good data representation Φ, we consider a multi-task setting in which the same representation is shared among several different clustering tasks, which can potentially capture different and complementary clustering axis

What if my main objective is to receive meaningful clusters. I would like to train model on my own unlabeled, rather messy dataset and then clean dataset from particular images filtering out some clusters.
Would you recommend to set number of heads T =1 in that case?

@yukimasano
Copy link
Owner

Hey!
If the main objective is to cluster images (as opposed to learning representations), you could try something I mentioned here: #11 (i.e. start with a pretrained model and only reset the last layer). And yes, probably starting with one head makes most sense.

Later you can also play around with replacing the head (currently a linear layer) with a MLP and see if it further improves / makes more diverse heads.

@yukimasano
Copy link
Owner

Good luck!

@ksulima
Copy link
Author

ksulima commented Sep 2, 2020

Thanks for a quick response. I will follow your advice.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants