initial multi-lora support #1103

mcmonkey4eva · 2023-04-12T18:41:58Z

Contains initial support for loading multiple LoRAs at once.

Works as a checkbox group, with a refresh button, and an apply button.

If you checkmark new loras that weren't checkmarked before, it loads them very very quickly. If you unload prior loras, it removes all for now and then re-adds. Way faster than a full model reload.

The merge_and_unload function seems to not support 8-bit.

This alters requirements to require a direct git copy of peft for now, as they haven't published the update with this feature yet.

I have not fully tested the results of generating with multiple LoRAs, only that they load/unload and the model still works.

I have also not tested against the possibility of memleaks or other issues arising from repeatedly mucking with loras on the fly.

I have only tested in 8bit with LLaMA-13B currently.

oobabooga · 2023-04-13T16:59:13Z

I'll try to review this one and #1098 later today. Better supporting LoRAs is a priority to me know and your PRs are very helpful. I'm also considering using a custom version of PEFT in the requirements.txt to support applying LoRAs to 4-bit models.

bmoconno · 2023-04-13T20:13:57Z

server.py

@@ -211,8 +211,9 @@ def create_model_menus():
                ui.create_refresh_button(shared.gradio['model_menu'], lambda: None, lambda: {'choices': get_available_models()}, 'refresh-button')
        with gr.Column():
            with gr.Row():
-                shared.gradio['lora_menu'] = gr.Dropdown(choices=available_loras, value=shared.lora_name, label='LoRA')
+                shared.gradio['lora_menu'] = gr.CheckboxGroup(choices=available_loras, value=shared.lora_names, label='LoRA model(s)')


I don't know how many LoRAs people might end up with, but you could maybe keep this a Dropdown and add the multiselect=True argument. Probably a clearer UI experience?

Oh wow, I'm dumb. In #853 this is what the auto webui used for styles (my suggested option #4) but I never looked into how that was done. This is indeed much cleaner. Going to change that and push, though it seems I'll need to rebase and force-push as Main branch changed code right next to the server.py edits here.

rebuilt off main

mcmonkey4eva · 2023-04-14T00:03:18Z

Rebuilt from main and repushed with a mutliselect dropdown.

Should probably find a way to make that Apply button smaller.

oobabooga · 2023-04-14T05:16:12Z

If you select multiple LoRAs (like 4 or 5), the row containing the new button and the LoRA dropdown grows in an awkward way. Is it possible to implement this menu in such way that it occupies a constant area and never grows?

mcmonkey4eva · 2023-04-14T05:28:50Z

uhh... by really forcing it with CSS

overflow: scroll;
max-height: 3rem;

it prevents vertical growth, at the cost of the replacement awkwardness that if you have more LoRAs than fit on one line, they're hidden behind a scrollbar.

I don't know if that's worth doing? I can add it if you prefer it.

oobabooga · 2023-04-14T16:24:20Z

modules/LoRA.py


-    if lora_name not in ['None', '']:
-        print(f"Adding the LoRA {lora_name} to the model...")
+    # Only adding, and already peft? Do it the easy way.


Is it correct to assume that the model is already peft? For instance, if you load llama-7b without the --lora argument, it will not have been loaded with PeftModel.from_pretrained.

Edit: okay, this is only executed if len(set(shared.lora_names)) > 0, in which case the model will have been loaded with PeftModel.from_pretrained.

oobabooga · 2023-04-14T16:25:38Z

modules/LoRA.py

@@ -25,7 +38,11 @@ def add_lora_to_model(lora_name):
            elif shared.args.load_in_8bit:
                params['device_map'] = {'': 0}

-        shared.model = PeftModel.from_pretrained(shared.model, Path(f"{shared.args.lora_dir}/{lora_name}"), **params)
+        shared.model = PeftModel.from_pretrained(shared.model, Path(f"{shared.args.lora_dir}/{lora_names[0]}"), **params)


Also related to the comment above, if the model is "fresh", is it necessary to reload it with PeftModel.from_pretrained?

If I'm not mistaken, this isn't actually a full reload, this is just taking the non-peft model and wrapping it (+ applying the first lora). It definitely runs a lot faster than a full model load, and doesn't output any of the loading nonsense to console at least.

oobabooga · 2023-04-14T16:26:09Z

requirements.txt

 requests
 rwkv==0.7.3
 safetensors==0.3.0
 sentencepiece
 pyyaml
 tqdm
+git+https://github.com/huggingface/peft


Just to be sure, is the dev version of peft required? The code seems to run without errors with peft==0.2.0.

peft 0.2.0 was released March 9th https://github.com/huggingface/peft/releases and multi-adapter support was merged April 6th huggingface/peft#263 so, yes, it's needed? I'm not sure what could lead it to seemingly work on 0.2.0 for you, possibly you accidentally had a different version installed while testing because you were recently testing johnsmith0031/alpaca_lora_4bit#13 as well? idk.

It's definitely needed then.

oobabooga · 2023-04-14T16:28:50Z

uhh... by really forcing it with CSS
overflow: scroll;
max-height: 3rem;
it prevents vertical growth, at the cost of the replacement awkwardness that if you have more LoRAs than fit on one line, they're hidden behind a scrollbar.

I don't know if that's worth doing? I can add it if you prefer it.

I did some reorganizing and it looks fine now, no need to change the CSS. What I really wanted was for the model and the lora dropdowns to be on the same line.

mcmonkey4eva mentioned this pull request Apr 12, 2023

Multi-adapter PEFT loading #853

Closed

bmoconno reviewed Apr 13, 2023

View reviewed changes

initial multi-lora support

0143b92

rebuilt off main

mcmonkey4eva force-pushed the multi-lora-support branch from 435c600 to 0143b92 Compare April 14, 2023 00:02

oobabooga added 2 commits April 14, 2023 13:12

Merge main branch, reorganize the layout a bit

ee90c31

Fix --lora

6325b5d

oobabooga reviewed Apr 14, 2023

View reviewed changes

oobabooga added 2 commits April 14, 2023 14:42

Remove unused import

bcee80e

Change a label and some messages

63806d9

oobabooga merged commit 64e3b44 into oobabooga:main Apr 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial multi-lora support #1103

initial multi-lora support #1103

mcmonkey4eva commented Apr 12, 2023

oobabooga commented Apr 13, 2023

bmoconno Apr 13, 2023

mcmonkey4eva Apr 13, 2023

mcmonkey4eva commented Apr 14, 2023

oobabooga commented Apr 14, 2023

mcmonkey4eva commented Apr 14, 2023 •

edited

Loading

oobabooga Apr 14, 2023

oobabooga Apr 14, 2023

mcmonkey4eva Apr 14, 2023

oobabooga Apr 14, 2023

mcmonkey4eva Apr 14, 2023

oobabooga Apr 14, 2023

oobabooga commented Apr 14, 2023

initial multi-lora support #1103

initial multi-lora support #1103

Conversation

mcmonkey4eva commented Apr 12, 2023

oobabooga commented Apr 13, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mcmonkey4eva commented Apr 14, 2023

oobabooga commented Apr 14, 2023

mcmonkey4eva commented Apr 14, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oobabooga commented Apr 14, 2023

mcmonkey4eva commented Apr 14, 2023 •

edited

Loading