Synchronize lora's merge, unmerge, etc. modifications to lora's tp_layer. #1919

zhangsheng377 · 2024-07-10T10:32:57Z

There was a previous commit that moved the merge and other functions in LoraLayer to Linear, etc., but the LoraParallelLinear in tp_layer was missed.

…ayer Synchronize lora's merge, unmerge, etc. modifications to lora's tp_layer.

zhangsheng377 · 2024-07-10T10:33:25Z

@BenjaminBossan

HuggingFaceDocBuilderDev · 2024-07-10T16:35:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan · 2024-07-10T16:39:05Z

Thanks for this update. Could you please run make style?

zhangsheng377 · 2024-07-11T00:58:14Z

Thanks for this update. Could you please run make style?

Ok. Sorry, I forgot it.

BenjaminBossan

The PR is failing on Python 3.8 because of type annotation syntax. Adding from __future__ import annotations should fix that. I assume that you tried out that the changes you made to the megatron layer work?

BenjaminBossan · 2024-07-15T10:30:10Z

src/peft/tuners/lora/tp_layer.py

@@ -108,7 +116,7 @@ def update_layer(
        else:
            lora_dropout_layer = nn.Identity()

-        self.lora_dropout[adapter_name] = lora_dropout_layer
+        self.lora_dropout.update(nn.ModuleDict({adapter_name: lora_dropout_layer}))


This should not be necessary, right?

zhangsheng377 · 2024-07-15T12:41:55Z

The PR is failing on Python 3.8 because of type annotation syntax. Adding from __future__ import annotations should fix that. I assume that you tried out that the changes you made to the megatron layer work?

The merge func is that I copyed from Linear.merge

But I forgot to add the from __future__ import annotations

BenjaminBossan · 2024-07-15T15:11:29Z

I don't really have experience with megatron, so I'm not sure if these methods would just work when copied 1:1. Just to be sure, did you test with your setup that nothing breaks with these changes? If yes, I think we can merge and in the future try to be more careful to keep tp_layer.py in sync.

Synchronize lora's merge, unmerge, etc. modifications to lora's tp_l…

b811059

…ayer Synchronize lora's merge, unmerge, etc. modifications to lora's tp_layer.

make style

1e9375f

BenjaminBossan requested changes Jul 15, 2024

View reviewed changes

Update tp_layer.py

68fb4ce

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Synchronize lora's merge, unmerge, etc. modifications to lora's tp_layer. #1919

Synchronize lora's merge, unmerge, etc. modifications to lora's tp_layer. #1919

zhangsheng377 commented Jul 10, 2024

zhangsheng377 commented Jul 10, 2024

HuggingFaceDocBuilderDev commented Jul 10, 2024

BenjaminBossan commented Jul 10, 2024

zhangsheng377 commented Jul 11, 2024

BenjaminBossan left a comment

BenjaminBossan Jul 15, 2024

zhangsheng377 commented Jul 15, 2024

BenjaminBossan commented Jul 15, 2024

Synchronize lora's merge, unmerge, etc. modifications to lora's tp_layer. #1919

Are you sure you want to change the base?

Synchronize lora's merge, unmerge, etc. modifications to lora's tp_layer. #1919

Conversation

zhangsheng377 commented Jul 10, 2024

zhangsheng377 commented Jul 10, 2024

HuggingFaceDocBuilderDev commented Jul 10, 2024

BenjaminBossan commented Jul 10, 2024

zhangsheng377 commented Jul 11, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Jul 15, 2024

Choose a reason for hiding this comment

zhangsheng377 commented Jul 15, 2024

BenjaminBossan commented Jul 15, 2024