Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]Merge lmdeploy lite calibrate and lmdeploy lite auto_awq #849

Merged
merged 3 commits into from
Jan 9, 2024

Conversation

pppppM
Copy link
Collaborator

@pppppM pppppM commented Dec 15, 2023

Before this PR, AWQ quantization needs to execute two commands

lmdeploy lite calibrate  $HF_MODEL
lmdeploy lite auto_awq $HF_MODEL

In this PR, AWQ quantization only needs to execute one command

lmdeploy lite auto_awq $HF_MODEL

@lvhan028
Copy link
Collaborator

Will it affect kv8 quantization?

@lvhan028 lvhan028 added the enhancement New feature or request label Dec 18, 2023
docs/en/w4a16.md Outdated

```shell
lmdeploy lite calibrate \
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

随意改 API/用法会被怼。 实在要改,就保持旧的加 deprecate 时间。

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个更改不会影响原本用法,calibrate 依旧会保留,原本的 lmdeploy calibrate + lmdeploy auto_awq 依旧可以使用

@pppppM
Copy link
Collaborator Author

pppppM commented Jan 8, 2024

Will it affect kv8 quantization?

It will not affect the usage of KV8 and the original W4A16.
The new usage of W4A16 will be simpler than the old one.

Copy link
Collaborator

@RunningLeon RunningLeon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lvhan028 lvhan028 merged commit 1a76191 into InternLM:main Jan 9, 2024
3 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants