-
-
Notifications
You must be signed in to change notification settings - Fork 869
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LISA can not run on multi-GPU setting #1474
Comments
Can you try setting |
Add |
are you using FSDP or deepspeed? |
It seems this might be some DDP specific issue. I've tried a few things like setting a deterministic seed for the random layer picker and adding |
I'm not sure which one I am using. This is my first time using your LLM framework, and I've only added 'ddp_find_unused_parameters: true' to the 'lisa.yaml' file without making any other changes. |
btw, here's a discussion on deepspeed issues w/ LISA: OptimalScale/LMFlow#726 and a potential workaround: OptimalScale/LMFlow#726 (comment) |
Please check that this issue hasn't been reported before.
Expected Behavior
The LISA should run on multi-GPU.
Current behaviour
The LISA can only run on single-GPU. Change to the multi-GPU will lead to below bug.
Steps to reproduce
Below is the multi-GPU config.
Config yaml
Possible solution
No response
Which Operating Systems are you using?
Python Version
3.9
axolotl branch-commit
main
Acknowledgements
The text was updated successfully, but these errors were encountered: