[Bugfix] fix lora_dtype value type in arg_utils.py #5398

c3-ali · 2024-06-11T00:02:15Z

FILL IN THE PR DESCRIPTION HERE

rkooo567

QQ: does it not support torch.dtype?

c3-ali · 2024-06-11T17:39:01Z

@rkooo567 It does! I followed the argparse options of choices=['auto', 'float16', 'bfloat16', 'float32'] and used str but it seems LoraConfig specify it as lora_dtype: Optional[torch.dtype] = None and the implementation supports both str and torch.dtype.

    def verify_with_model_config(self, model_config: ModelConfig):
        if self.lora_dtype in (None, "auto"):
            self.lora_dtype = model_config.dtype
        elif isinstance(self.lora_dtype, str):
            self.lora_dtype = getattr(torch, self.lora_dtype)

So lora_dtype: Optional[Union[str, torch.dtype]] = 'auto' is a precise definition. I'm going to make that change.

fix lora_dtype value type in arg_utils.py

3fe019d

simon-mo approved these changes Jun 11, 2024

View reviewed changes

simon-mo enabled auto-merge (squash) June 11, 2024 00:56

rkooo567 approved these changes Jun 11, 2024

View reviewed changes

zhuohan123 disabled auto-merge June 11, 2024 05:50

zhuohan123 enabled auto-merge (squash) June 11, 2024 05:50

simon-mo disabled auto-merge June 11, 2024 17:40

simon-mo merged commit 00e6a2d into vllm-project:main Jun 11, 2024
100 of 103 checks passed

c3-ali mentioned this pull request Jun 11, 2024

[Bugfix] fix lora_dtype value type in arg_utils.py - part 2 #5428

Open

robertgshaw2-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jun 12, 2024

[Bugfix] fix lora_dtype value type in arg_utils.py (vllm-project#5398)

cbb4376

joerunde pushed a commit to joerunde/vllm that referenced this pull request Jun 17, 2024

[Bugfix] fix lora_dtype value type in arg_utils.py (vllm-project#5398)

444b779

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jun 27, 2024

[Bugfix] fix lora_dtype value type in arg_utils.py (vllm-project#5398)

79fc9ca

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 8, 2024

[Bugfix] fix lora_dtype value type in arg_utils.py (vllm-project#5398)

6bfe969

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Bugfix] fix lora_dtype value type in arg_utils.py (vllm-project#5398)

ea1963a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] fix lora_dtype value type in arg_utils.py #5398

[Bugfix] fix lora_dtype value type in arg_utils.py #5398

c3-ali commented Jun 11, 2024

rkooo567 left a comment

c3-ali commented Jun 11, 2024

[Bugfix] fix lora_dtype value type in arg_utils.py #5398

[Bugfix] fix lora_dtype value type in arg_utils.py #5398

Conversation

c3-ali commented Jun 11, 2024

rkooo567 left a comment

Choose a reason for hiding this comment

c3-ali commented Jun 11, 2024