-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fsdp_qlora fail #3907
Labels
solved
This problem has been already solved
Comments
try disabling evaluation after training? |
how do I do that? |
remove the eval args in yaml config |
same thing happened this time during training..
|
hiyouga
added
solved
This problem has been already solved
and removed
pending
This problem is yet to be addressed
labels
May 28, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Reminder
Reproduction
After reinstalling LLaMA-Factory with the latest commits without changing anything, I ran the above script. Which does sft to llama3-8b. It didn't work. One of the processes seemed to shut down during validation:
Expected behavior
do training
System Info
GPUs: 2*RTX 3090
Others
No response
The text was updated successfully, but these errors were encountered: