added support for quantize on LLM module #1080

orellavie1212 · 2023-09-18T12:23:46Z

No description provided.

updating LLM to fix the problem engine_args = EngineArgs( TypeError: __init__() got an unexpected keyword argument 'quantization'

updated the docstr of the class

orellavie1212 · 2023-09-18T12:24:49Z

fixing the problem arise via init (even if kwargs mentioned)
engine_args = EngineArgs(
TypeError: init() got an unexpected keyword argument 'quantization'

WoosukKwon · 2023-09-18T17:42:27Z

@orellavie1212 Thanks for the PR.

fixing the problem arise via init (even if kwargs mentioned)
engine_args = EngineArgs(
TypeError: init() got an unexpected keyword argument 'quantization'

Could you explain the problem in more detail? While I'm good with adding the quantization parameter to the LLM class for clarity, I believe it should already work. I've checked that llm = LLM(model="casperhansen/vicuna-7b-v1.5-awq", quantization="awq") just works.

orellavie1212 · 2023-09-18T17:45:52Z

I did exactly as you mentioned, I thought too, on python 3.9 (aws sagemaker) it doesn't work and quantize param didn't work, only after I made the change, it worked.
It could depend on the python version or other configuration, but to make it robust you could merge. The problem is mentioned on the first comment, this is exactly the bug.

WoosukKwon

@orellavie1212 LGTM! I made minor changes on the docstring. Thanks again for submitting the PR.

It's the full list of changes in documentation prepared for the vLLM 1.21 release. --------- Signed-off-by: Artur Fierka <artur.fierka@intel.com> Co-authored-by: Bartosz Kuncer <bartosz.kuncer@intel.com> Co-authored-by: Bartosz Kuncer <bkuncer@habana.ai> Co-authored-by: Mohit Deopujari <mdeopujari@habana.ai> Co-authored-by: Artur Fierka <artur.fierka@intel.com> Co-authored-by: AnetaKaczynska <aneta.kaczynska@intel.com>

orellavie1212 added 2 commits September 18, 2023 15:14

Update llm.py

a4b4e81

updating LLM to fix the problem engine_args = EngineArgs( TypeError: __init__() got an unexpected keyword argument 'quantization'

Update llm.py

51a6764

updated the docstr of the class

WoosukKwon added 2 commits September 18, 2023 17:59

Minor fix

8aa36e9

Minor

50e3c9c

WoosukKwon approved these changes Sep 18, 2023

View reviewed changes

WoosukKwon merged commit fbe66e1 into vllm-project:main Sep 18, 2023

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

added support for quantize on LLM module (vllm-project#1080)

a44d0f8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

added support for quantize on LLM module #1080

added support for quantize on LLM module #1080

Uh oh!

orellavie1212 commented Sep 18, 2023

Uh oh!

orellavie1212 commented Sep 18, 2023

Uh oh!

WoosukKwon commented Sep 18, 2023

Uh oh!

orellavie1212 commented Sep 18, 2023 •

edited

Loading

Uh oh!

WoosukKwon left a comment

Uh oh!

Uh oh!

Uh oh!

added support for quantize on LLM module #1080

added support for quantize on LLM module #1080

Uh oh!

Conversation

orellavie1212 commented Sep 18, 2023

Uh oh!

orellavie1212 commented Sep 18, 2023

Uh oh!

WoosukKwon commented Sep 18, 2023

Uh oh!

orellavie1212 commented Sep 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

orellavie1212 commented Sep 18, 2023 •

edited

Loading