-
Notifications
You must be signed in to change notification settings - Fork 238
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
什么时候有量化后的模型 #48
Comments
这两天会发布 |
可以试试 bitsandbytes 量化,我写了一篇教程: |
已经有Chat的量化版本。见更新后的README |
@GradientGuru 大佬,Chat的量化版本链接好像失效了,无法下载,麻烦帮忙看看 |
找到了,可以用这个链接访问下载 8bit量化后的模型,https://huggingface.co/trillionmonster/Baichuan-13B-Chat-8bit/tree/main |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
RT
现在每次启动都需要使用CPU量化,速度太慢了
The text was updated successfully, but these errors were encountered: