We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我在 V100 单卡上可以跑得动 glm-10b 英文的推理,但是跑 quickstart 中的任务时把模型改成 glm-10b-ch 就会 OOM
The text was updated successfully, but these errors were encountered:
我们正在尝试复现您的问题,请耐心等待。
Sorry, something went wrong.
可以在加载模型的时候将model=loader.get_model() 改成model = loader.get_model().half() 通过这种方式将模型转化为float16, 在V100里就可以装下了。而且这样不会影响预测结果,因为GLM模型本身训练的时候就是float16。
可以了,谢谢
Successfully merging a pull request may close this issue.
我在 V100 单卡上可以跑得动 glm-10b 英文的推理,但是跑 quickstart 中的任务时把模型改成 glm-10b-ch 就会 OOM
The text was updated successfully, but these errors were encountered: