Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The time cost for training this instructor? #14

Closed
afalf opened this issue Feb 15, 2023 · 6 comments
Closed

The time cost for training this instructor? #14

afalf opened this issue Feb 15, 2023 · 6 comments

Comments

@afalf
Copy link

afalf commented Feb 15, 2023

Hi, thanks for your great work. I do not see the time cost for training this model on the paper. Could you public the time cost for training the different size models(base, large, xl)?

@Harry-hash
Copy link
Contributor

Hi, thanks a lot for your interest in INSTRUCTOR!

It takes around 6, 12 and 30 hours to train base, large and xl-sized models respectively.

Feel free to leave any further question or comment here!

@afalf
Copy link
Author

afalf commented Feb 16, 2023

Hi, thanks a lot for your interest in INSTRUCTOR!

It takes around 6, 12 and 30 hours to train base, large and xl-sized models respectively.

Feel free to leave any further question or comment here!

Thanks a lot for your reply. But I tried to run the train.py following README, and find that it needs about 300 hours to train the gtr-base model under 4 A100 GPUs.... What should I do to check it?
image

@Harry-hash
Copy link
Contributor

Hi, thanks a lot for your question!

We have prepared much more training data here. It may not be necessary to complete all the training steps. In our experiments, we stopped tuning INSTRUCTOR-base at around 40K steps.

Hope this helps! Feel free to leave any further question or comment here!

@afalf
Copy link
Author

afalf commented Feb 16, 2023

Hi, thanks a lot for your question!

We have prepared much more training data here. It may not be necessary to complete all the training steps. In our experiments, we stopped tuning INSTRUCTOR-base at around 40K steps.

Hope this helps! Feel free to leave any further question or comment here!

Okay, thanks a lot!

@Harry-hash
Copy link
Contributor

Feel free to reopen this issue and add any following comments!

@yangjianxin1
Copy link

what is the batch size to train the instructor? Is 32 enough?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants