Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training on custom dataset #6

Closed
GriesserP opened this issue Apr 25, 2022 · 1 comment
Closed

Training on custom dataset #6

GriesserP opened this issue Apr 25, 2022 · 1 comment

Comments

@GriesserP
Copy link

Thank you for your super contributions,
What would be the best way to use your code to train and predict the cluster assignment of a list of embeddings representing my data points. i.e. something like .fit(X) or .fit_predict(X) from sklearn ?

@meitarronen
Copy link
Contributor

Thank you for your interest in our work!
For training, you should follow the instructions available in the readme file, and train using the DeepDPM.py file (see section Training in our readme).
In theory, you could also use this on your test set (DeepDPM is an unsupervised method, labels are not used in any training stage (we use them in our code only for evaluation, but they are not used in training).

That said, we have recently uploaded an example of using DeepDPM for inference (fit_predict). See example script under scripts/DeepDPM_load_from_checkpoint.py

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants