You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for the excellent work, and I plan to use your research as the foundation for my agenda! However, I have limited computation resources, so I would like to ask about the estimated time to train the models.
I am using a 4GPU machine to run with batch size 16, and each epoch takes ~3 hours. I am curious: what is the speed on your side? If my speed is too slow, I might spend time debugging with the server.
For the sake of quick iteration, I am curious if you have tried to use less data or epochs during the development stage. If so, could you please share some insights of your experiment settings and the degradation of performance when training less?
Thank you again for your kind help!
The text was updated successfully, but these errors were encountered:
@MaureenZOU Thank you so much for the prompt reply! It is good to know that my current speed is normal, but it is also quite long to train a model using around 6 days (150 hours). I will share with you if I find any ways to accelerate this process in the future.
I am quite interested in the techniques you mentioned. I will be really grateful if you could share with me your tricks of pre-compute the language encoder weights. Thank you!
Hi,
Thank you for the excellent work, and I plan to use your research as the foundation for my agenda! However, I have limited computation resources, so I would like to ask about the estimated time to train the models.
Thank you again for your kind help!
The text was updated successfully, but these errors were encountered: