Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update AWQ GPTQ quantization guide #306

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Update AWQ GPTQ quantization guide #306

wants to merge 2 commits into from

Conversation

miaojinc
Copy link

update content related to Convert Python API, add quantization options.

@Duyi-Wang Duyi-Wang added the documentation Improvements or additions to documentation label Apr 26, 2024
@pujiang2018
Copy link
Contributor

My concern is that the packages in requirements.txt may trigger some issues during security checking, let's target for next version.

@miaojinc
Copy link
Author

miaojinc commented Jun 4, 2024

My concern is that the packages in requirements.txt may trigger some issues during security checking, let's target for next version.

Yes, it has some potential issues. The reason of the packages version is because we lacks the group quantization operators, so we have to quantize the models on CPU according to our kernel. If we can align with autoawq and autogptq, we can load the quantized weights directly with out any modification for autoawq and autogptq.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants