Performance on Resnet101 network #32

twtygqyy · 2018-04-06T14:39:27Z

Hi, I've implemented the resnet101 structure on the top of vgg16 network, while the mAP on VOC datasets could only reach 0.62 after 20 epochs.
Do you have any idea what the problem would be? You can find the code here. Thank you.

chenyuntc · 2018-04-06T14:57:07Z

I think maybe resnet101 is difficult to train.

This maybe helpful.

For Resnets, we fix the first block (total 4) when fine-tuning the network, and only use crop_and_resize to resize the RoIs (7x7) without max-pool (which Xinlei finds useless especially for COCO). The final feature maps are average-pooled for classification and regression. All batch normalization parameters are fixed. Learning rate for biases is not doubled.

twtygqyy · 2018-04-06T15:05:34Z

@chenyuntc Thanks, I also fixed the weights for top layers, but the result didn't improve. As you mentioned, it might be the reason of BN and biases. I'll have another try.

twtygqyy · 2018-04-09T18:20:04Z

Hi @chenyuntc, I've trained the model with:

Fix the first block.
Learning rate for biases is not doubled.
All batch normalization parameters are fixed.
Use 1e-4 as weight decay.

And I restrictedly followed the way of training as I did in caffe, while it seems the performance cannot be improved.
Have you tried to train the model on networks other than VGG16?

chenyuntc · 2018-04-14T11:03:22Z

Actually, I only tried VGG16.

blateyang · 2019-01-26T11:48:44Z

I recently also want to implement resnet structure based on this project. And I found your @twtygqyy codes are very helpful to me. But I have a question about batch normalization. Why we need to fix batch normalization parameters here?

stickOverCarrot · 2020-10-17T11:47:54Z

@blateyang BN only work when batch_size>1 and only work well when batch_size>=16.You can see this paper https://arxiv.org/abs/2002.05712.However，@chenyuntc code only surport batch_size==1

blateyang · 2020-10-19T11:52:55Z

Thanks for your reply! stickOverCarrot <notifications@github.com> 于2020年10月17日周六下午7:48写道：

…

@blateyang <https://github.com/blateyang> BN only work when batch_size>1 and only work well when batch_size>=16.You can see this paper ***@***.*** code only surport batch_size==1 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#32 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGHYD4MAIDOBBVEXAMFTZETSLF77NANCNFSM4EZIVDVA> .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance on Resnet101 network #32

Performance on Resnet101 network #32

twtygqyy commented Apr 6, 2018

chenyuntc commented Apr 6, 2018 •

edited

twtygqyy commented Apr 6, 2018

twtygqyy commented Apr 9, 2018 •

edited

chenyuntc commented Apr 14, 2018

blateyang commented Jan 26, 2019

stickOverCarrot commented Oct 17, 2020

blateyang commented Oct 19, 2020 via email

Performance on Resnet101 network #32

Performance on Resnet101 network #32

Comments

twtygqyy commented Apr 6, 2018

chenyuntc commented Apr 6, 2018 • edited

twtygqyy commented Apr 6, 2018

twtygqyy commented Apr 9, 2018 • edited

chenyuntc commented Apr 14, 2018

blateyang commented Jan 26, 2019

stickOverCarrot commented Oct 17, 2020

blateyang commented Oct 19, 2020 via email

chenyuntc commented Apr 6, 2018 •

edited

twtygqyy commented Apr 9, 2018 •

edited