FIX Make DoRA work with Conv1D layers #1588

BenjaminBossan · 2024-03-25T15:55:49Z

Previously, code was not transposing weights when calculating the norm.

In addition, update some DoRA tests to use BitsAndBytesConfig.

Previously, code was not transposing weights when calculating the norm. In addition, update some DoRA tests to use BitsAndBytesConfig.

HuggingFaceDocBuilderDev · 2024-03-25T15:59:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

arash2060 · 2024-03-25T16:45:50Z

src/peft/tuners/lora/layer.py

Line 399 and 405 also needs to be updated. I needed to transpose the delta_weight in weight_norm = self._get_weight_norm(orig_weights, delta_weight, scaling=1) and transpose dora_factor.view(-1, 1) to match the dimensions.

Similarly for unsafe merge.

Thanks for the feedback. Do you have a code snippet that results in an error without these additional changes?

Nothing that I can share, unfortunately. To replicate, it should be enough to initialize a DoRA model on gpt2 and call merge_and_unload() on it.

There you go:

import transformers, peft model = transformers.AutoModelForCausalLM.from_pretrained('gpt2') dora_config = peft.LoraConfig(r=4, use_dora=True) model = peft.get_peft_model(model, dora_config) model.merge_and_unload('/tmp/m/')

Unloading and merging model: 5%|▌ | 9/176 [00:00<00:01, 113.93it/s] Traceback (most recent call last): File "...python3.9/site-packages/IPython/core/interactiveshell.py", line 3526, in run_code exec(code_obj, self.user_global_ns, self.user_ns) File "<ipython-input-4-3672d1da59ca>", line 1, in <module> model.merge_and_unload('/tmp/m/') File "...python3.9/site-packages/peft/tuners/lora/model.py", line 784, in merge_and_unload return self._unload_and_optionally_merge( File "...python3.9/site-packages/peft/tuners/lora/model.py", line 438, in _unload_and_optionally_merge target.merge(safe_merge=safe_merge, adapter_names=adapter_names) File "...python3.9/site-packages/peft/tuners/lora/layer.py", line 420, in merge weight_norm = self._get_weight_norm(base_layer.weight, delta_weight, scaling=1).detach() File "...python3.9/site-packages/peft/tuners/lora/layer.py", line 176, in _get_weight_norm weight = weight + scaling * lora_weight RuntimeError: The size of tensor a (768) must match the size of tensor b (2304) at non-singleton dimension 1```

Thanks a lot, I added a fix and tests for merge_and_unload. If you could check again, that would be great.

Works on my end. Thank you!

younesbelkada

Thanks @BenjaminBossan !

FIX Make DoRA work with Conv1D layers

7b7349b

Previously, code was not transposing weights when calculating the norm. In addition, update some DoRA tests to use BitsAndBytesConfig.

BenjaminBossan mentioned this pull request Mar 25, 2024

Implement DoRA #1474

Merged

arash2060 reviewed Mar 25, 2024

View reviewed changes

Fix bug with Conv1D when merging and unloading

957e83b

BenjaminBossan requested review from pacman100 and younesbelkada March 26, 2024 10:21

younesbelkada approved these changes Apr 5, 2024

View reviewed changes

BenjaminBossan merged commit 26726bf into huggingface:main Apr 5, 2024
14 checks passed

BenjaminBossan deleted the fix-dora-with-conv1d branch April 5, 2024 13:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX Make DoRA work with Conv1D layers #1588

FIX Make DoRA work with Conv1D layers #1588

BenjaminBossan commented Mar 25, 2024

HuggingFaceDocBuilderDev commented Mar 25, 2024

arash2060 Mar 25, 2024 •

edited

BenjaminBossan Mar 25, 2024

arash2060 Mar 25, 2024 •

edited

arash2060 Mar 25, 2024

arash2060 Mar 25, 2024

BenjaminBossan Mar 25, 2024

arash2060 Mar 25, 2024

younesbelkada left a comment

FIX Make DoRA work with Conv1D layers #1588

FIX Make DoRA work with Conv1D layers #1588

Conversation

BenjaminBossan commented Mar 25, 2024

HuggingFaceDocBuilderDev commented Mar 25, 2024

arash2060 Mar 25, 2024 • edited

Choose a reason for hiding this comment

BenjaminBossan Mar 25, 2024

Choose a reason for hiding this comment

arash2060 Mar 25, 2024 • edited

Choose a reason for hiding this comment

arash2060 Mar 25, 2024

Choose a reason for hiding this comment

arash2060 Mar 25, 2024

Choose a reason for hiding this comment

BenjaminBossan Mar 25, 2024

Choose a reason for hiding this comment

arash2060 Mar 25, 2024

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

arash2060 Mar 25, 2024 •

edited

arash2060 Mar 25, 2024 •

edited