Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to increase model capacity for training on a larger dataset? #53

Closed
daniyalDE opened this issue Aug 11, 2020 · 9 comments
Closed

How to increase model capacity for training on a larger dataset? #53

daniyalDE opened this issue Aug 11, 2020 · 9 comments

Comments

@daniyalDE
Copy link

daniyalDE commented Aug 11, 2020

First of all thanks for the amazing work on U-2-net. Now i am trying to train the model from scratch on my own dataset of 60k images which is larger than your dataset. I would like to know how i can increase the model capacity to be able to train on such a dataset.

I have considered replacing the standard rebnconv blocks with residuals as suggested in another issue. What other options i could try? I understand that i need to make the architecture deeper, does this mean that i should make RSU-8 or RSU-9 blocks by adding more convolution layers?

@xuebinqin
Copy link
Owner

xuebinqin commented Aug 12, 2020 via email

@shgidi
Copy link

shgidi commented Aug 12, 2020

@daniyalDE Hi Daniel, I'm interested in similar tasks as well.
Why do you assume that the original model doesn't have the capacity of such a task? How do you determine that the model was "maxed out" on the 10K dataset it was trained on?

@daniyalDE
Copy link
Author

@Nathanua thanks for the feedback. One last thing regarding (4) the input resolution, from what i understand the training dataloader always rescales the input images to 320x320, so if i want to train with higher resolution images should i change the rescale ratio to a higher value?

@daniyalDE
Copy link
Author

@shgidi i have tried training on my dataset and the loss/accuracy stalls after a while which might be that the current model is not complex enough to learn the features of my data which is very different from the datasets that they trained on originally.

@xuebinqin
Copy link
Owner

xuebinqin commented Aug 13, 2020 via email

@EricLe-dev
Copy link

Thanks for your interest. You can try following ideas: (1) increase the filter numbers of each layer or add more layers in the basic bn_relu_conv module, (2) remove some of the dense supervision, (3) try to build RSU-8 or RSU-9, (4) input resolution also matters, etc.

On Tue, Aug 11, 2020 at 6:32 AM Daniyal Arshad @.***> wrote: First of all thanks for the amazing work on U-2-net. Now i am trying to train the model from scratch on my own dataset which is 60k images which is larger than your dataset. I would like to know how i can increase the model capacity to be able to train on such a dataset. I have considered replacing the standard rebnconv blocks with residuals as suggested in another issue. What other options i could try? I understand that i need to make the architecture deeper, does this mean that i should make RSU-8 or RSU-9 blocks by adding more convolution layers? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#53>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADSGORKJ5UXPQJVYO5AW2RTSAE26NANCNFSM4P27TBGA .
-- Xuebin Qin PhD Department of Computing Science University of Alberta, Edmonton, AB, Canada Homepage:https://webdocs.cs.ualberta.ca/~xuebin/

Can you please tell me how to disable the side output? I tried disabling them by commenting them out but it did not work. Thank you so much.

@xuebinqin
Copy link
Owner

xuebinqin commented Sep 8, 2020 via email

@EricLe-dev
Copy link

EricLe-dev commented Sep 8, 2020

Thank you so much for your reply. I have a very quick question since I am a big fan of your previous work - BASNet. Does this shares any similarity with this (line 47 - 53 in basnet_train.py)?

As I also shall this kind of behavior with BASNet.
Your quick response is appreciated.

@xuebinqin
Copy link
Owner

xuebinqin commented Sep 8, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants