Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

smrl_cfvqa_rubi is TOO slow to train #12

Closed
Mike4Ellis opened this issue Dec 15, 2021 · 4 comments
Closed

smrl_cfvqa_rubi is TOO slow to train #12

Mike4Ellis opened this issue Dec 15, 2021 · 4 comments

Comments

@Mike4Ellis
Copy link

I can train all the other versions except smrl_cfvqa_rubi with batch_size = 256, I have to change it to 64 preventing CUDA out of memory. But the training is too slow, which takes a day to train a epoch with three 3090.

I wonder what is the difference between smrl_cfvqa_rubi and other versions making it so speical, and is it normal to train so slowly? or is it because I did something wrong?

@yuleiniu
Copy link
Owner

It is wired. Could you also try smrl_cfvqasimple_rubi as it has the same function as smrl_cfvqa_rubi?

BTW, I will delete smrl_cfvqa_rubi to avoid confusion as it is redundant.

@Mike4Ellis
Copy link
Author

It is wired. Could you also try smrl_cfvqasimple_rubi as it has the same function as smrl_cfvqa_rubi?

BTW, I will delete smrl_cfvqa_rubi to avoid confusion as it is redundant.

I have trained smrl_cfvqasimple_rubi, and it works well.

I realized the only difference between smrl_cfvqasimple_rubi and smrl_cfvqa_rubi is q_single = True in smrl_cfvqa_rubi while False in smrl_cfvqasimple_rubi.
Also, smrl_cfvqa_rubi has mlp_v, but may not work because fusion_mode is rubi.

Is it because q_single? What dose this parameter really mean? And why do we need a single text encoder?

@yuleiniu
Copy link
Owner

The q_single option uses a separate text encoder for the QA branch. In our implementation, the main branch and QA branch share the same text encoder, following RUBi, \ie, q_single=False. For mlp_v, you are right, this option does not work when we implement the 'cfvqasimple' option. You can use smrl_cfvqasimple_rubi for applying RUBi+CF-VQA.

@Mike4Ellis
Copy link
Author

Okay! I got it.

Thank you very much for your patient reply!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants