Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training Time Estimation #112

Open
ziqipang opened this issue Dec 31, 2023 · 3 comments
Open

Training Time Estimation #112

ziqipang opened this issue Dec 31, 2023 · 3 comments

Comments

@ziqipang
Copy link

Hi,

Thank you for the excellent work, and I plan to use your research as the foundation for my agenda! However, I have limited computation resources, so I would like to ask about the estimated time to train the models.

  • I am using a 4GPU machine to run with batch size 16, and each epoch takes ~3 hours. I am curious: what is the speed on your side? If my speed is too slow, I might spend time debugging with the server.
  • For the sake of quick iteration, I am curious if you have tried to use less data or epochs during the development stage. If so, could you please share some insights of your experiment settings and the degradation of performance when training less?

Thank you again for your kind help!

@MaureenZOU
Copy link
Collaborator

Your training speed doesn't seem to be wrong, one trick I developed for the next project is to precompute the language encoder weight: https://github.com/UX-Decoder/FIND/blob/708ddf53ab594fe6be642bae2ff54eb42cdb8a9a/configs/grin/focalt_unicl_lang.yaml#L58. If you are interested in this, I could share more details.

@ziqipang
Copy link
Author

@MaureenZOU Thank you so much for the prompt reply! It is good to know that my current speed is normal, but it is also quite long to train a model using around 6 days (150 hours). I will share with you if I find any ways to accelerate this process in the future.

I am quite interested in the techniques you mentioned. I will be really grateful if you could share with me your tricks of pre-compute the language encoder weights. Thank you!

@Jeenedo2023
Copy link

Hello, what kind of GPU are you using.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants