Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

can I train in one A100 80G GPU? #51

Closed
hkuit opened this issue Sep 7, 2023 · 3 comments
Closed

can I train in one A100 80G GPU? #51

hkuit opened this issue Sep 7, 2023 · 3 comments

Comments

@hkuit
Copy link

hkuit commented Sep 7, 2023

Hello, thanks for the great work.

Can I train the model using only one A100 80G GPU? Or how can we modify the code so that it can be trained on one gpu? Thank you so much.

@mmaaz60
Copy link
Member

mmaaz60 commented Sep 8, 2023

Hi @hkuit,

Yes, you can train on a single A100-80G GPU. Please make sure to keep the overall batch size to 32. This can be achieved by using the following setting,

--per_device_train_batch_size 8 \
--gradient_accumulation_steps 4 \

Please let me know if it works. Thanks

@hkuit
Copy link
Author

hkuit commented Sep 10, 2023

Thank you for the reply @mmaaz60 , let me try it and update to you later.

@hkuit
Copy link
Author

hkuit commented Sep 11, 2023

Thanks, it works.

@hkuit hkuit closed this as completed Sep 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants