Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

net output dimension mismatch on cifar10 #5

Closed
lucaskyle opened this issue Dec 9, 2019 · 7 comments
Closed

net output dimension mismatch on cifar10 #5

lucaskyle opened this issue Dec 9, 2019 · 7 comments

Comments

@lucaskyle
Copy link

hey there:

when i run python train_CIFAR-10.py

Batch_size=128
so the output of network should be (128,10)
i got (512,10)

ValueError: Expected input batch_size (512) to match target batch_size (128).

@lucaskyle
Copy link
Author

should i modify the Network?

@lucaskyle
Copy link
Author

ok i checked official magvii code
seems like u r working on training supernet on cifar10?

@lucaskyle
Copy link
Author

the gap_size should be 2

@ShunLu91
Copy link
Owner

ShunLu91 commented Dec 10, 2019

Thanks! I will fix these bugs.

@lucaskyle
Copy link
Author

lucaskyle commented Dec 10, 2019

this is so strange
the train loss is still Loss: 2.303
it didnt converge

i try other models that the train loss reduced easily...

maybe sth wrong in the supernetwork.
i will check magvii codes and their supermodel.

@ShunLu91
Copy link
Owner

Sorry, there are some bugs in the model.py and it has been fixed now. You can refer to the latest version of the code.

@lucaskyle
Copy link
Author

its the learning rate
seems like u fixed it.

thx

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants