add minibatching by lvwerra · Pull Request #153 · huggingface/trl

lvwerra · 2023-02-16T14:20:08Z

Until now the PPO mini batch size has been hardcoded to 1. This PR aims to change it by refactoring the forward/backward passing logic.

In summary this PR does the following things:

The batched_forward_pass returns new a mask which can be used to mask parts of the sequence to be ignored
enable mini-batching of PPO by creating a small dataloader with the mini_batch_size to sample from the current PPO batch
In the loss method we replace all operations affected by masked parts of the sequence with masked ones (masked_mean, masked_whiten)
remove compute_logits_vpred and use batched_forward_pass for everything
extend testing and refactor it (i don't think we need subfolders for the 3 test files we have)

W&B logs:

HuggingFaceDocBuilderDev · 2023-02-21T17:07:09Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Thanks a lot for this great addition! I left few comments and questions as a first pass!

younesbelkada · 2023-02-21T18:49:36Z

+            mini_batch_data,
+            batch_size=self.config.mini_batch_size,
+            shuffle=True,
+            collate_fn=collator,


Suggested change

collate_fn=collator,

collate_fn=collator,

drop_last=True,

Maybe we can add this to avoid some corner-cases such as the one described on a previous issue

Sounds good, let's also set a warning if that's the case so the user knows that a batch will be dropped.

younesbelkada · 2023-02-21T18:54:15Z

-        bs = self.config.batch_size
-        fbs = self.config.forward_batch_size
+        bs = len(queries)
+        fbs = min(bs, self.config.forward_batch_size)


So this is the case where the last element has less instances than the mini_batch_size or the case a users put a batch_size that is smaller than mini_batch_size on the config? If it's the second case we can maybe add a warning on the config, if the first case since we have drop_last=True set here I don't think we'll face this case but I am not sure

It's for the case where mini_batch_size is smaller than forward_batch_size during the forward passes inside the minibatch loop. I am also not quite happy with how we do it actually.

younesbelkada

Also, what about completely removing forward_batch_size from the config? I don't think this is a breaking change as the configs cannot be pushed on the Hub, just need to update the examples accodingly. I believe this can be done on a follow up PR too

lvwerra · 2023-02-22T10:59:03Z

The breaking change actually also happens for users who currently use the library with forward_batch_size. What do you think about setting it default to None and overwrite mini_batch_size if it's set to another value with a warning that it affects now also the mini_batch_size if set to a value?

younesbelkada · 2023-02-22T10:59:52Z

This solution makes a lot of sense yes!

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

review-notebook-app · 2023-02-22T11:35:55Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

lvwerra · 2023-02-22T11:36:46Z

Deprecated forward_batch_size: feel free to have a look!

younesbelkada

Thanks a lot for your great work on this! 💯

* add minibatching * all the fixes i missed * ore fixes * add dedicated variable for mini batch size * style * minor fixes * fix rewards * unbiased variance estimation * mask values/returns * moar fixes * style * change structure and add moar tests * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * deprecate `forward_batch_size` * remove out of date warning about batching s2s and left padding models * make style * fixed failed merge --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

leandro and others added 12 commits February 16, 2023 15:19

add minibatching

af2ccdc

all the fixes i missed

a70f8a5

ore fixes

1bda802

add dedicated variable for mini batch size

0d5e478

style

8f7eacb

minor fixes

bcff3a5

fix rewards

847930f

unbiased variance estimation

c6a5673

mask values/returns

6ac2742

moar fixes

ab9dcce

style

c504a27

change structure and add moar tests

9634b1d

lvwerra marked this pull request as ready for review February 21, 2023 17:01

Merge branch 'main' into mini-batching

e5d1030

lvwerra requested review from edbeeching, natolambert and younesbelkada February 21, 2023 17:13

younesbelkada reviewed Feb 21, 2023

View reviewed changes

lvwerra and others added 4 commits February 22, 2023 12:01

Apply suggestions from code review

aa05758

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

deprecate forward_batch_size

2859989

remove out of date warning about batching s2s and left padding models

475bf2e

Merge remote-tracking branch 'origin/mini-batching' into mini-batching

ed66bd4

lvwerra requested a review from younesbelkada February 22, 2023 11:36

make style

8bb1936

younesbelkada approved these changes Feb 22, 2023

View reviewed changes

lvwerra and others added 2 commits February 22, 2023 14:35

Merge branch 'main' into mini-batching

bea7408

fixed failed merge

d6a2fe5

lvwerra merged commit f1300ec into main Feb 23, 2023

lvwerra deleted the mini-batching branch February 23, 2023 14:24

raj47212 mentioned this pull request Feb 25, 2023

minibatching changes and masking #176

Closed

Conversation

lvwerra commented Feb 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

younesbelkada Feb 21, 2023

Choose a reason for hiding this comment

Uh oh!

lvwerra Feb 22, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

younesbelkada Feb 21, 2023

Choose a reason for hiding this comment

Uh oh!

lvwerra Feb 22, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

lvwerra commented Feb 22, 2023

Uh oh!

younesbelkada commented Feb 22, 2023

Uh oh!

review-notebook-app bot commented Feb 22, 2023

Uh oh!

lvwerra commented Feb 22, 2023

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lvwerra commented Feb 16, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 21, 2023 •

edited

Loading