New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
does opendelta support gradient_checkpointing? #39
Comments
opendelta supports bmtrain which utilizes gradient checkpointing. So which framework of gradient checkpointing do you use? Can you share a minimal reproduction code? |
the codes are:
and the exception is:
|
The problem seems to be that the
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thank you for the awesome work.
I met some problems when using opendelta with gradient_checkpointing, it just throws:
"RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn"
btw code works well as gradient_checkpointing is closed.
so does opendelta support gradient_checkpointing?
The text was updated successfully, but these errors were encountered: