Implement a `batch_size` parameter in the `pipeline` object #13141

xegulon · 2021-08-16T16:05:56Z

🚀 Feature request

Implement a batch_size parameter in the pipeline object, so that when we call it, it computes the predictions by batches of sentences and then does get CUDA Out of Memory errors.

Ideally, this optional argument would have a good default, computed from the tokenizer's parameters and the hardware the code is running on.

References to this need in the forum:
https://discuss.huggingface.co/t/how-to-make-pipeline-automatically-scale/7432/3
https://discuss.huggingface.co/t/how-to-change-the-batch-size-in-a-pipeline/8738

Motivation

When making inference on very long list of sentences using the pipeline object, I often get CUDA OOM errors.

Your contribution

I could try :)

The text was updated successfully, but these errors were encountered:

xegulon · 2021-08-23T08:06:15Z

@sgugger ?

LysandreJik · 2021-08-23T11:53:38Z

Hello @xegulon, this is in line with some work currently underway by @Narsil

Narsil · 2021-08-23T12:57:23Z

@xegulon,

Batching on inference is something to be very cautious about, because alignment might heavily penalize the speed of inference.

See #11251 and https://gist.github.com/Narsil/ee5c09875e74fa6f018dc6d014f6c06c for more information.

Cuda OOM errors are most likely due to the fact that you are padding way too much, and actually showcase the slow down.

The big refactor mentionned by @LysandreJik is ready here https://github.com/Narsil/transformers/tree/iterable_pipelines

With said PR, you should be able to actually stream all your data to the GPU leading to a massive speedup (like DataLoader), and if you want to do batching because you know it will speedup (please measure real payloads, it's unlikely to be significant, so make sure it is a speedup) you can do it by manually using Dataloader, preprocess, forward and postprocess.
The proposed PR will use DataLoader (for pt) by default if you send lists too. You can also send directly Datasets.

xegulon · 2021-08-26T09:00:00Z

Great (useful) work @Narsil thanks a lot. Is it planned to be released in v4.10.0?

Narsil · 2021-08-26T09:33:53Z

I don't think it will make it in time, it's a pretty massive change, we're pulling in stuff bit by bit to make sure we're not breaking anything (we're in a phase where we're strengthening the tests first)

github-actions · 2021-09-19T15:01:44Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Narsil · 2021-09-20T07:35:59Z

@xegulon the modifications have landed in master, can you confirm it speeds up inference without the need for batch_size ?

github-actions · 2021-11-08T15:02:30Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Narsil mentioned this issue Aug 30, 2021

[Large PR] Entire rework of pipelines. #13308

Merged

5 tasks

Narsil mentioned this issue Oct 11, 2021

Adding batch_size support for (almost) all pipelines #13724

Merged

12 tasks

huggingface deleted a comment from github-actions bot Oct 14, 2021

Narsil closed this as completed Nov 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a `batch_size` parameter in the `pipeline` object #13141

Implement a `batch_size` parameter in the `pipeline` object #13141

xegulon commented Aug 16, 2021 •

edited

Loading

xegulon commented Aug 23, 2021

LysandreJik commented Aug 23, 2021

Narsil commented Aug 23, 2021 •

edited

Loading

xegulon commented Aug 26, 2021

Narsil commented Aug 26, 2021

github-actions bot commented Sep 19, 2021

Narsil commented Sep 20, 2021

github-actions bot commented Nov 8, 2021

Implement a batch_size parameter in the pipeline object #13141

Implement a batch_size parameter in the pipeline object #13141

Comments

xegulon commented Aug 16, 2021 • edited Loading

🚀 Feature request

Motivation

Your contribution

xegulon commented Aug 23, 2021

LysandreJik commented Aug 23, 2021

Narsil commented Aug 23, 2021 • edited Loading

xegulon commented Aug 26, 2021

Narsil commented Aug 26, 2021

github-actions bot commented Sep 19, 2021

Narsil commented Sep 20, 2021

github-actions bot commented Nov 8, 2021

Implement a `batch_size` parameter in the `pipeline` object #13141

Implement a `batch_size` parameter in the `pipeline` object #13141

xegulon commented Aug 16, 2021 •

edited

Loading

Narsil commented Aug 23, 2021 •

edited

Loading