Training Time issue when training Condensenet-light on cifar100 #21

infrontofme · 2019-01-15T01:32:32Z

Hi,
I am reproducing your work in tensorflow, but I found that dropping during training has taken a lot of time. I would like to ask if you have encountered such a problem. What do you think might be the reason?

ShichenLiu · 2019-01-28T07:05:03Z

Hi,

In my implementation, I multiply the weight tensor with a binary mask, which should not significantly affect the training speed. Could you provide some description about how you implement the dropping operator? Thanks!

infrontofme · 2019-01-28T11:59:50Z

Thank you for your reply.

I also multiply the weight tensor with a binary mask.
I found there are some differences in Calculation Graph between tensorflow and pytorch. The graph of tensorflow is static. When I use the loop in my code, it will cause the calculation graph to become larger and the calculation to be slower. Then, I fixed my code, and the problem was solved.

your work is awesome！

ShichenLiu · 2019-01-28T18:26:14Z

Yes, do not use for loop during training in any deep learning framework. That will cause frequently starting and stoping cuda kernel, which will largely slow down the speed. Glad to hear that the problem has been fixed!

ShichenLiu closed this as completed Jan 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Time issue when training Condensenet-light on cifar100 #21

Training Time issue when training Condensenet-light on cifar100 #21

infrontofme commented Jan 15, 2019

ShichenLiu commented Jan 28, 2019

infrontofme commented Jan 28, 2019

ShichenLiu commented Jan 28, 2019

Training Time issue when training Condensenet-light on cifar100 #21

Training Time issue when training Condensenet-light on cifar100 #21

Comments

infrontofme commented Jan 15, 2019

ShichenLiu commented Jan 28, 2019

infrontofme commented Jan 28, 2019

ShichenLiu commented Jan 28, 2019