[Feature]Merge `lmdeploy lite calibrate` and `lmdeploy lite auto_awq` #849

pppppM · 2023-12-15T17:30:35Z

Before this PR, AWQ quantization needs to execute two commands

lmdeploy lite calibrate  $HF_MODEL
lmdeploy lite auto_awq $HF_MODEL

In this PR, AWQ quantization only needs to execute one command

lmdeploy lite auto_awq $HF_MODEL

lvhan028 · 2023-12-18T03:00:55Z

Will it affect kv8 quantization?

lmdeploy/lite/apis/auto_awq.py

lmdeploy/lite/apis/calibrate.py

tpoisonooo · 2023-12-20T02:00:37Z

docs/en/w4a16.md


 ```shell
-lmdeploy lite calibrate \


随意改 API/用法会被怼。实在要改，就保持旧的加 deprecate 时间。

这个更改不会影响原本用法，calibrate 依旧会保留，原本的 lmdeploy calibrate + lmdeploy auto_awq 依旧可以使用

pppppM · 2024-01-08T14:15:56Z

Will it affect kv8 quantization?

It will not affect the usage of KV8 and the original W4A16.
The new usage of W4A16 will be simpler than the old one.

RunningLeon

LGTM

pppppM requested a review from lvhan028 December 15, 2023 17:31

lvhan028 requested review from RunningLeon and tpoisonooo December 18, 2023 02:56

lvhan028 added the enhancement New feature or request label Dec 18, 2023

RunningLeon reviewed Dec 18, 2023

View reviewed changes

lmdeploy/lite/apis/auto_awq.py Show resolved Hide resolved

RunningLeon reviewed Dec 18, 2023

View reviewed changes

lmdeploy/lite/apis/calibrate.py Outdated Show resolved Hide resolved

tpoisonooo reviewed Dec 20, 2023

View reviewed changes

pppppM mentioned this pull request Dec 21, 2023

[Fix] Fix conflicts in lite #878

Merged

merge calibrate and auto_awq

680e3b0

pppppM force-pushed the merge_calibrate branch from d55e896 to 680e3b0 Compare January 8, 2024 13:51

pppppM added 2 commits January 8, 2024 22:09

update cli

a7a8844

update docstring

97e5da1

pppppM requested review from RunningLeon and tpoisonooo January 8, 2024 14:12

tpoisonooo approved these changes Jan 9, 2024

View reviewed changes

RunningLeon approved these changes Jan 9, 2024

View reviewed changes

lvhan028 merged commit 1a76191 into InternLM:main Jan 9, 2024
3 of 5 checks passed

pppppM had a problem deploying to prod February 7, 2024 14:10 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]Merge `lmdeploy lite calibrate` and `lmdeploy lite auto_awq` #849

[Feature]Merge `lmdeploy lite calibrate` and `lmdeploy lite auto_awq` #849

pppppM commented Dec 15, 2023

lvhan028 commented Dec 18, 2023

tpoisonooo Dec 20, 2023 •

edited

pppppM Jan 8, 2024

pppppM commented Jan 8, 2024

RunningLeon left a comment

[Feature]Merge lmdeploy lite calibrate and lmdeploy lite auto_awq #849

[Feature]Merge lmdeploy lite calibrate and lmdeploy lite auto_awq #849

Conversation

pppppM commented Dec 15, 2023

lvhan028 commented Dec 18, 2023

tpoisonooo Dec 20, 2023 • edited

Choose a reason for hiding this comment

pppppM Jan 8, 2024

Choose a reason for hiding this comment

pppppM commented Jan 8, 2024

RunningLeon left a comment

Choose a reason for hiding this comment

[Feature]Merge `lmdeploy lite calibrate` and `lmdeploy lite auto_awq` #849

[Feature]Merge `lmdeploy lite calibrate` and `lmdeploy lite auto_awq` #849

tpoisonooo Dec 20, 2023 •

edited