partial implementation of lqlora #8324

Liebele · 2024-04-24T13:23:52Z

PR types

PR changes

Description

CLAassistant · 2024-04-24T13:23:58Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

paddle-bot · 2024-04-24T13:23:59Z

Thanks for your contribution!

codecov · 2024-04-24T13:53:25Z

Codecov Report

Attention: Patch coverage is 0% with 41 lines in your changes are missing coverage. Please review.

Project coverage is 55.23%. Comparing base (277f45b) to head (33e2fbd).
Report is 3 commits behind head on develop.

Files	Patch %	Lines
paddlenlp/peft/lora/lqlora_utils.py	0.00%	41 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8324      +/-   ##
===========================================
- Coverage    55.25%   55.23%   -0.03%     
===========================================
  Files          613      614       +1     
  Lines        95626    95667      +41     
===========================================
  Hits         52837    52837              
- Misses       42789    42830      +41

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Liebele · 2024-04-26T06:04:06Z

算法实现原理如下：

lugimzzz · 2024-04-29T03:45:19Z

paddlenlp/peft/lora/lqlora_utils.py

+                lora_A = Ur @ paddle.diag(paddle.sqrt(Sr))
+                lora_B = paddle.diag(paddle.sqrt(Sr)) @ Vhr
+
+                Q = qlora_weight_quantize_dequantize(W-lora_A@lora_B, double_quant=True)


double_quant=True，应该作为一个可调节参数，qlora_weight_quantize_dequantize中的其他参数也一样

lugimzzz · 2024-04-29T03:46:50Z

paddlenlp/peft/lora/lqlora_utils.py

+                Sr = S[:num_ranks]
+                Vhr = Vh[:num_ranks]
+
+                lora_A = Ur @ paddle.diag(paddle.sqrt(Sr))


配置的时候需要考虑lora scaling，看起来lora scaling只能强制为1

lugimzzz · 2024-04-29T06:36:38Z

paddlenlp/peft/lora/lqlora_utils.py

+
+            if W.dtype in [paddle.float16]:
+                old_dtype = W.dtype
+                W = paddle.cast(W, dtype=paddle.float32)


cast成fp32的原因？

参考了论文原作者在pytorch下的实现
https://github.com/HanGuo97/lq-lora/blob/c2424b3adc27197815da1ac9e1304565168d824d/models/lq_utils.py#L117

lugimzzz · 2024-04-29T06:44:27Z

有没有实验结果可以参考一下效果

lugimzzz · 2024-04-29T06:44:42Z

提交之前，修复格式问题

cd PaddleNLP
pre-commit install

Liebele · 2024-05-06T02:18:23Z

在E2E数据集上的微调结果：

partial implementation of lqlora

33e2fbd

paddle-bot bot added the contributor label Apr 24, 2024

paddle-bot bot assigned ZHUI Apr 24, 2024

ZHUI requested a review from lugimzzz April 25, 2024 06:12

lugimzzz reviewed Apr 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

partial implementation of lqlora #8324

partial implementation of lqlora #8324

Liebele commented Apr 24, 2024

CLAassistant commented Apr 24, 2024

paddle-bot bot commented Apr 24, 2024

codecov bot commented Apr 24, 2024

Liebele commented Apr 26, 2024

lugimzzz Apr 29, 2024

lugimzzz Apr 29, 2024

lugimzzz Apr 29, 2024

Liebele May 6, 2024

lugimzzz commented Apr 29, 2024

lugimzzz commented Apr 29, 2024 •

edited

Liebele commented May 6, 2024

partial implementation of lqlora #8324

Are you sure you want to change the base?

partial implementation of lqlora #8324

Conversation

Liebele commented Apr 24, 2024

PR types

PR changes

Description

CLAassistant commented Apr 24, 2024

paddle-bot bot commented Apr 24, 2024

codecov bot commented Apr 24, 2024

Codecov Report

Liebele commented Apr 26, 2024

lugimzzz Apr 29, 2024

Choose a reason for hiding this comment

lugimzzz Apr 29, 2024

Choose a reason for hiding this comment

lugimzzz Apr 29, 2024

Choose a reason for hiding this comment

Liebele May 6, 2024

Choose a reason for hiding this comment

lugimzzz commented Apr 29, 2024

lugimzzz commented Apr 29, 2024 • edited

Liebele commented May 6, 2024

lugimzzz commented Apr 29, 2024 •

edited