Allow for dynamic batch padding #2352

muellerzr · 2024-01-18T15:30:14Z

What does this PR do?

This PR allows for dynamic batch padding by finding and duplicating the last item in the batch before concating everything.

Current issue:

PiPPy has issues with batch inference on GPT-2, will talk to the pippy folks about this

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@SunMarc

src/accelerate/inference.py

HuggingFaceDocBuilderDev · 2024-01-18T15:35:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SunMarc

Thanks for adding this ! Left a few comments. From ke's comments, it looks like the batch issue is linked to the tracing.

src/accelerate/inference.py

SunMarc · 2024-01-18T16:56:50Z

src/accelerate/inference.py

+            process_index=0,
+            num_processes=state.num_processes,
+        )
+        extra = concatenate([extra] * ((found_batch_size % state.num_processes) + 1))


see related comment in slack.

src/accelerate/inference.py

src/accelerate/test_utils/scripts/external_deps/test_pippy.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

SunMarc

We are practically there. Left a few comments.

src/accelerate/inference.py

tests/test_utils.py

muellerzr · 2024-01-25T13:56:34Z

src/accelerate/utils/operations.py

+        old_size = tensor.shape
+        new_size = list(old_size)
+        new_size[0] = batch_size + to_pad
+        new_tensor = tensor.new_zeros(tuple(new_size))


It's okay to just pad 0, we can drop these afterwards and the user won't know that padded inputs were event sent ideally

* Broken version * Timing I would expect * Working version! * Use MethodType * working test * Tests * Use no split module classes explicitly * Put split_points in pipelien * Store split points in hf_split_points * fix case num_process=1 * Allow for dynamic batch padding (#2352) * Allow for dynamic batch paddign * Fix test * Update src/accelerate/inference.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Break early after the first valid bs is found * Less slicy-dicy * Test cv model * Start, need to test * Use dataloader-like logic * Refactor to utils * With tests * Update the source * Clean * bs=1 case * Add test * add some failing test * Almost working version * Much cleaner implementation * Use pad_input_tensor * All tests passing! * Do it at tracing too --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Marc Sun <marc@huggingface.co> * Rm literal * Allow users to pass in max_memory * Note about recursion * Document, document, document * Right import check * Fix bug, add tests to multigpu runners * Change default to None * Start of docs * Try again? * Try again x2 * Trailing comma * Move import * Clean * typehint * typo * From code review * Use num_chunks * Update tests/test_utils.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Bad copy/paste * hf_split_points --------- Co-authored-by: Marc Sun <marc@huggingface.co> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

Allow for dynamic batch paddign

73e64a1

muellerzr requested a review from SunMarc January 18, 2024 15:30

Fix test

3534a34

muellerzr commented Jan 18, 2024

View reviewed changes

src/accelerate/inference.py Outdated Show resolved Hide resolved

SunMarc reviewed Jan 18, 2024

View reviewed changes

muellerzr and others added 4 commits January 18, 2024 12:40

Update src/accelerate/inference.py

0af31ff

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

Break early after the first valid bs is found

b1c565d

Less slicy-dicy

a98e51b

Test cv model

8a452c1

muellerzr requested a review from SunMarc January 18, 2024 21:11

SunMarc reviewed Jan 18, 2024

View reviewed changes

src/accelerate/inference.py Outdated Show resolved Hide resolved

src/accelerate/inference.py Outdated Show resolved Hide resolved

muellerzr added 3 commits January 19, 2024 10:11

Start, need to test

2e23e1e

Use dataloader-like logic

c7800f5

Refactor to utils

1a9181c

muellerzr requested a review from SunMarc January 19, 2024 15:32

SunMarc reviewed Jan 19, 2024

View reviewed changes

src/accelerate/inference.py Outdated Show resolved Hide resolved

muellerzr and others added 6 commits January 19, 2024 12:29

With tests

fed86c4

Update the source

fcc72a3

Clean

7045a29

bs=1 case

6167d0b

Add test

7e95802

add some failing test

5727767

SunMarc reviewed Jan 24, 2024

View reviewed changes

tests/test_utils.py Outdated Show resolved Hide resolved

muellerzr added 5 commits January 25, 2024 07:44

Almost working version

6864314

Much cleaner implementation

a45e7f9

Use pad_input_tensor

136f495

All tests passing!

5970e8e

Do it at tracing too

6e1e02f

muellerzr commented Jan 25, 2024

View reviewed changes

muellerzr merged commit dac1daa into pippy-integration-v2 Jan 25, 2024
23 checks passed

muellerzr deleted the pippy-duplicates branch January 25, 2024 13:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for dynamic batch padding #2352

Allow for dynamic batch padding #2352

muellerzr commented Jan 18, 2024

HuggingFaceDocBuilderDev commented Jan 18, 2024

SunMarc left a comment

SunMarc Jan 18, 2024

SunMarc left a comment

muellerzr Jan 25, 2024

Allow for dynamic batch padding #2352

Allow for dynamic batch padding #2352

Conversation

muellerzr commented Jan 18, 2024

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Jan 18, 2024

SunMarc left a comment

Choose a reason for hiding this comment

SunMarc Jan 18, 2024

Choose a reason for hiding this comment

SunMarc left a comment

Choose a reason for hiding this comment

muellerzr Jan 25, 2024

Choose a reason for hiding this comment