Fix/load lora save lora #1805

Erland366 · 2025-02-22T23:18:40Z

People complaining that they can't use the LoRA with VLLM because load_lora method is not available. This is because when loading a LoRA model, get_peft_model goes into the Unsloth: Already have LoRA adapters! We shall skip this step. patching.

This PR is simply putting the patching inside that stage as well. We can't just move the patching to the beginning on the function or else when doing inference while training (like training GRPO), it'll do the inference only on the base model

This PR still has a flaw that the inference of vLLM has to be run once first before it's able to do inference on the loaded lora. Which maybe related of this part in the unsloth-zoo?

https://github.com/unslothai/unsloth-zoo/blob/a9857088bdaf412bef36800d837a3a37657555c8/unsloth_zoo/vllm_utils.py#L1206-L1212

Related issue -> #1670 (comment)

Erland366 added 2 commits February 22, 2025 23:12

Able to add the function, need to double check

3d8006e

Remove forgotten pdb

11bab8a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix/load lora save lora #1805

Fix/load lora save lora #1805

Uh oh!

Erland366 commented Feb 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Fix/load lora save lora #1805

Are you sure you want to change the base?

Fix/load lora save lora #1805

Uh oh!

Conversation

Erland366 commented Feb 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant