adds sentiment example for a 20b model by edbeeching · Pull Request #208 · huggingface/trl

edbeeching · 2023-03-09T12:23:39Z

This PR adds a sentiment example for a 20b model.
There are 3 scripts:

s01_cm_finetune_peft_imdb.py - Fine tuning a Low Rank Adapter on a frozen 8-bit model for text generation on the imdb dataset.
s02_merge_peft_adapter.py - Merging of the adapter layers into the base model’s weights and storing these on the hub.
s03_gpt-neo-20b_sentiment_peft.py - Sentiment fine-tuning of a Low Rank Adapter to create positive reviews.

HuggingFaceDocBuilderDev · 2023-03-09T12:27:34Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Thanks a lot for this! Left few nits!
Let's also add a line about it on the docs 💪 Let me know if you want me to do that!

lvwerra

In general happy to merge. Two comments

left a nit about the naming
we should add it to the list of examples in the docs

…entiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

…-neox-20b

younesbelkada

Thanks a lot for this!

Rinpas · 2023-04-08T02:35:30Z

Hi，there. I am a beginner in the field of NLP and I have been working with the GPT-J model recently. I came across your code for merging adapter layers into the base model's weights in s02_merge_peft_adapter.py , and I have some questions regarding the merging process.

From my understanding, after fine-tuning the model with the LORA layer and running this merging code, the LORA layer is replaced with a new randomly initialized linear layer. However, I did not see any indication in the code that the parameters of the LORA layer were inherited by this new linear layer.If this is the case, then it would mean that my previous training of only the LORA layer was pointless.

I would be grateful if you could provide me with some clarification on this matter. Thank you very much for your time and help.

younesbelkada · 2023-04-08T09:35:11Z

Hi @Kororinpas
Please have a look at this nice thread: #250 that goes over the details of merging LoRA layers!
Also now you can directly merge the lora layers using peft with the model.merge_and_unload() utility function, see huggingface/peft#227
Let us know if you have more questions after that

Rinpas · 2023-04-09T01:02:50Z

@younesbelkada Thank you for sharing this information with me. I have already checked out the thread you suggested and the original code. Now, my problem is solved. Thanks again!

* adds sentiment example for a 20b model * Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_sentiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_sentiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_sentiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_sentiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * removed numbers from script names * adds examples to docs * cm -> clm * style --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

adds sentiment example for a 20b model

1eb7e77

younesbelkada reviewed Mar 9, 2023

View reviewed changes

lvwerra reviewed Mar 9, 2023

View reviewed changes

Comment thread examples/sentiment/scripts/gpt-neox-20b_peft/README.md Outdated

edbeeching and others added 9 commits March 9, 2023 14:59

Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_s…

3f3582f

…entiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_s…

02ff55c

…entiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_s…

ddddef1

…entiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

Update examples/sentiment/scripts/gpt-neox-20b_peft/s03_gpt-neo-20b_s…

667dbfb

…entiment_peft.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

removed numbers from script names

7c030b0

adds examples to docs

342710f

Merge remote-tracking branch 'origin/peft-gpt-neox-20b' into peft-gpt…

fb2b019

…-neox-20b

cm -> clm

145ecaa

style

282bb60

lvwerra approved these changes Mar 9, 2023

View reviewed changes

younesbelkada approved these changes Mar 9, 2023

View reviewed changes

younesbelkada merged commit ddb6df3 into main Mar 9, 2023

younesbelkada deleted the peft-gpt-neox-20b branch March 9, 2023 14:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adds sentiment example for a 20b model#208

adds sentiment example for a 20b model#208
younesbelkada merged 10 commits intomainfrom
peft-gpt-neox-20b

edbeeching commented Mar 9, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Mar 9, 2023 •

edited

Loading

Uh oh!

younesbelkada left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lvwerra left a comment

Uh oh!

Uh oh!

younesbelkada left a comment •

edited

Loading

Uh oh!

Rinpas commented Apr 8, 2023

Uh oh!

younesbelkada commented Apr 8, 2023 •

edited

Loading

Uh oh!

Rinpas commented Apr 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

edbeeching commented Mar 9, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Mar 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lvwerra left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

younesbelkada left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Rinpas commented Apr 8, 2023

Uh oh!

younesbelkada commented Apr 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rinpas commented Apr 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HuggingFaceDocBuilderDev commented Mar 9, 2023 •

edited

Loading

younesbelkada left a comment •

edited

Loading

younesbelkada commented Apr 8, 2023 •

edited

Loading