net output dimension mismatch on cifar10 #5

lucaskyle · 2019-12-09T07:09:55Z

hey there:

when i run python train_CIFAR-10.py

Batch_size=128
so the output of network should be (128,10)
i got (512,10)

ValueError: Expected input batch_size (512) to match target batch_size (128).

lucaskyle · 2019-12-09T07:10:53Z

should i modify the Network?

lucaskyle · 2019-12-09T08:04:46Z

ok i checked official magvii code
seems like u r working on training supernet on cifar10？

lucaskyle · 2019-12-09T08:46:53Z

the gap_size should be 2

ShunLu91 · 2019-12-10T03:04:23Z

Thanks! I will fix these bugs.

lucaskyle · 2019-12-10T03:20:33Z

this is so strange
the train loss is still Loss: 2.303
it didnt converge

i try other models that the train loss reduced easily...

maybe sth wrong in the supernetwork.
i will check magvii codes and their supermodel.

ShunLu91 · 2019-12-12T06:46:20Z

Sorry, there are some bugs in the model.py and it has been fixed now. You can refer to the latest version of the code.

lucaskyle · 2019-12-12T06:50:44Z

its the learning rate
seems like u fixed it.

thx

ShunLu91 closed this as completed Dec 12, 2019

Provide feedback