-
Notifications
You must be signed in to change notification settings - Fork 204
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Strange interaction for DeepZero via text-generation-webui? #59
Comments
I’m not quite sure. I have seen this error before, but I am not sure what is causing it to happen. Did you compile from main or did you use the pypi package? |
This is from main, git revision 8788fe1. I pointed pip to github. |
I think I understand better now. DeepSpeed is trying to copy data that does not exist. This happens because we start by initializing the model with empty weights: Lines 337 to 338 in 8788fe1
In general, I am not sure AutoAWQ is even compatible with DeepSpeed since we rely on accelerate. |
That was what I suspected when I saw some comments on DeepSpeed about copying empty weights. Hopefully there is a way to stop DeepSpeed from kicking in for model types that are incompatible. |
Closing this since we use accelerate. |
I'm trying to integrate AutoAWQ into webui: oobabooga/text-generation-webui#3999
Unfortunately it is causing problems if DeepZero is used, I can't tell where the problem lies.
Any ideas?
The text was updated successfully, but these errors were encountered: