Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

It would be super cool if you could make a step by step tutorial on how to train the model on new datasets. #19

Closed
Christoph-Schuhmann opened this issue Aug 5, 2020 · 2 comments

Comments

@Christoph-Schuhmann
Copy link

Christoph-Schuhmann commented Aug 5, 2020

It would be super cool if you could make a step by step tutorial on how to train the model on new datasets. :)

I would like to use your technique to sort unlabled images into different domains, which then could be used to train GANs to create images of these domains.

@wvangansbeke
Copy link
Owner

Hi @Christoph-Schuhmann,

Thank you for your interest. Sounds like an interesting idea!

Listing the most important steps off the top of my head here:

  • To train on new datasets, you need to adapt your dataloader to the format we use in this repo. Have a look at the data folder and make sure your __getitem__ method is similar to ours.
  • Don't forget to add the path to your dataset in utils/mypath.py
  • Make config files (similar to the ones in the configs folder) for the pretext and clustering steps. I highly recommend using them.
  • Follow the steps in the readme

The same question was asked in issue #8, but if I have more time later I can go into more detail.

Hope this helps for now.

@wvangansbeke
Copy link
Owner

If something is still unclear, feel free to ask. I will close this issue for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants