add llama & qwen dpo #8474

lugimzzz · 2024-05-21T12:08:29Z

PR types

New features

PR changes

APIs

Description

新增dpo

paddle-bot · 2024-05-21T12:08:34Z

Thanks for your contribution!

codecov · 2024-05-21T12:40:56Z

Codecov Report

Attention: Patch coverage is 8.24176% with 501 lines in your changes are missing coverage. Please review.

Project coverage is 54.02%. Comparing base (b36b6a0) to head (affd27d).
Report is 5 commits behind head on develop.

❗ Current head affd27d differs from pull request most recent head 2854cf1

Please upload reports for the commit 2854cf1 to get more accurate results.

Files	Patch %	Lines
paddlenlp/trl/dpo_trainer.py	7.85%	352 Missing ⚠️
paddlenlp/trl/trl_data.py	3.66%	105 Missing ⚠️
paddlenlp/datasets/zero_padding_dataset.py	17.85%	23 Missing ⚠️
paddlenlp/trl/trl_utils.py	4.54%	21 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8474      +/-   ##
===========================================
- Coverage    54.29%   54.02%   -0.27%     
===========================================
  Files          617      621       +4     
  Lines        96339    96878     +539     
===========================================
+ Hits         52310    52343      +33     
- Misses       44029    44535     +506

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wawltor · 2024-05-22T08:46:35Z

llm/utils.py

@@ -211,13 +211,6 @@ def prediction_step(
            # keepdim in order to maintain the same shape as logits
            if isinstance(logits, (list, tuple)):
                logits = logits[0]
-            # all gather logits when enabling tensor_parallel_output


这里删除tensor parallel output的支持的原因是什么？

删错了，需要恢复

wawltor

LGTM

add llama&qwen dpo

47ef9c0

lugimzzz added 2 commits May 22, 2024 11:00

add

e7912b9

Merge branch 'dpo' of https://github.com/lugimzzz/PaddleNLP into dpo

bfaa91e

wawltor reviewed May 22, 2024

View reviewed changes

lugimzzz added 3 commits May 22, 2024 17:06

add dpo

233b894

fix bug

affd27d

add

2854cf1

wawltor approved these changes Jun 11, 2024

View reviewed changes

wawltor merged commit 909be01 into PaddlePaddle:develop Jun 11, 2024
8 of 10 checks passed

lugimzzz deleted the dpo branch June 27, 2024 07:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add llama & qwen dpo #8474

add llama & qwen dpo #8474

lugimzzz commented May 21, 2024

paddle-bot bot commented May 21, 2024

codecov bot commented May 21, 2024 •

edited

Loading

wawltor May 22, 2024

lugimzzz May 22, 2024

wawltor left a comment

add llama & qwen dpo #8474

add llama & qwen dpo #8474

Conversation

lugimzzz commented May 21, 2024

PR types

PR changes

Description

paddle-bot bot commented May 21, 2024

codecov bot commented May 21, 2024 • edited Loading

Codecov Report

wawltor May 22, 2024

Choose a reason for hiding this comment

lugimzzz May 22, 2024

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

codecov bot commented May 21, 2024 •

edited

Loading