Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about pvanet training #43

Closed
blankWorld opened this issue Sep 15, 2017 · 16 comments
Closed

about pvanet training #43

blankWorld opened this issue Sep 15, 2017 · 16 comments

Comments

@blankWorld
Copy link

hi, have you tried pvanet as basenetwork? I tried pvanet using caffe but encountered overfitting problem.
my training sets is 950 images from icdar 2015 trainningsets( the other 50 images as validation sets) and 229 images from icdar 2013.
model is trained by online data augmentation which includes scaling and rotations between ±30°. iou loss overfits a lot that when trainning iou descend to 0.25 validation iou loss still stays high at 0.7. I think I have confirmed everything so much that I can not solve this problem. please help me, Mr. Argman!!!!!!. I have cost two month on this problem.... 555555

@argman
Copy link
Owner

argman commented Sep 18, 2017

I have tried pva, it can get comparable result with resnet(need pretrain on imagenet).
IOU is just part of the loss, you should also see the regression part, during training you should set a balance parameter between the two losses. I do not recommend rotation augmentation as it may cause some un-agreement of text orientation and the start point of text polygon.

@blankWorld
Copy link
Author

thanks! so you just use random scaling to augment data as training with res_net-50?
In my training I use Iou loss and cosine loss to regress rbox and class balancing softmax loss to predict the mask of shrinked text polygon, what is the regression part? whether you mean that in pva training you used other rbox regression loss besides Iou and cosine loss?

@argman
Copy link
Owner

argman commented Sep 18, 2017

Do you use imagenet pretrained models?

@blankWorld
Copy link
Author

yes I do

@argman
Copy link
Owner

argman commented Sep 18, 2017

do you have a code repo? maybe some bugs..

@argman
Copy link
Owner

argman commented Sep 18, 2017

you can check this code first, its easy to run

@blankWorld
Copy link
Author

OK I go to check.. 5555 so sad

@blankWorld
Copy link
Author

sorry to bother you again, will dice loss influence much in pva training? I implemented entirely base upon paper's method

@argman
Copy link
Owner

argman commented Sep 18, 2017

In my experiment, dice loss perform much better..

@zxDeepDiver
Copy link

@argma Hi,what do you mean by

some un-agreement of text orientation and the start point of text polygon.

As far as I know, the rotation augmentation is used in many recent papers on text detection. Is there any difference between EAST and the other algorithms on dealing with the rotation augmentation?

@argman
Copy link
Owner

argman commented Sep 18, 2017

@zxDeepDiver , in east, there is upper, right, down and left side of a polygon, when rotated, this order can change, i think other methods encounter this problem too.

@blankWorld
Copy link
Author

hi! argman, now my iou average training loss over one mini-batch can reduce to 0.18( the loss I used is 1.0 - Iou, 0,18 means that training Iou is 1 - 0.18) and my validation average Iou loss over all test sets which has ignored 'do not care' region reduces to 0.44.
could you share your training and validation loss in the last few iterations?

@argman
Copy link
Owner

argman commented Sep 21, 2017

I do not test that, why do you test the final output ?

@argman argman closed this as completed Dec 6, 2017
@rmmal
Copy link

rmmal commented Sep 23, 2018

@argman do you have a tensorflow slim code for pvanet ?

@bharatsubedi
Copy link

do you have a tensorflow slim implementation code for pvanet?

@hvags
Copy link

hvags commented Sep 25, 2020

I have tried pva, it can get comparable result with resnet(need pretrain on imagenet).

Could you please advice on how you did the pre training? More specifically:

  • How to isolate the backbone and train only this
  • Any suggested subset of Imagenet for training

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants