Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Creating a new dataset #3

Open
scientist1642 opened this issue Feb 9, 2017 · 3 comments
Open

Creating a new dataset #3

scientist1642 opened this issue Feb 9, 2017 · 3 comments

Comments

@scientist1642
Copy link

Hi, I'm trying to create a new unlabeled dataset and had some questions:
https://github.com/CuriousAI/tagger/blob/master/data/shapes.py#L110
As I see masks are only used to calculate AMI score and doesn't take part in training process, right?
What about the "codes", what is it used for? in shapes it's empty and in Freq20-MNIST it seems related to textures.

@Qwlouse
Copy link

Qwlouse commented Feb 9, 2017

Correct. At some point we've used codes to see how well we could classify textures in the image. It is also not used for training. So just using arrays of zeros should work fine.

@scientist1642
Copy link
Author

Hi @Qwlouse, I am wondering how to choose the rough network size in order to not over/underfit.
For example, does shapes dataset really need (2000, 1000, 500) ladder?

@Qwlouse
Copy link

Qwlouse commented May 11, 2017

We haven't seen a case of overfitting yet. However for some reason the shapes network needs to be rather big. You can run with fewer units, but performance does degrade.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants