Attention mask is important in the case of batching... #16222

Narsil · 2022-03-17T11:23:56Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2022-03-17T11:37:12Z

The documentation is not available anymore as the PR was closed or merged.

Narsil · 2022-03-17T11:47:10Z

src/transformers/pipelines/base.py

In random models, special_tokens_mask would be extended in the batch with 0 instead of 1 so we could still predict PAD token in the pipeline.

I think having pad being always considered a special_tokens_mask is fine.

Narsil · 2022-03-17T11:48:01Z

src/transformers/pipelines/token_classification.py

return_attention_mask=True is also incorrect because FNet doesn't expect an attention mask.

Maybe FNet will continue to exhibit the flaw that pad tokens modify the output, I don't know enough about it though.

You will thus get the attention mask since you don't remove it afterward, but I'm guessing that's the whole point?

Actually it seems the FNet tokenizer doesn't return attention mask if we don't ask for it. (Which is fair since the model doesn't seem to accept them).

sgugger

LGTM, thanks for fixing!

sgugger · 2022-03-17T16:10:34Z

src/transformers/pipelines/token_classification.py

You will thus get the attention mask since you don't remove it afterward, but I'm guessing that's the whole point?

predictions.

* Attention mask is important in the case of batching... * Improve the fix. * Making the sentence different enough that they exhibit different predictions.

Narsil commented Mar 17, 2022

View reviewed changes

Narsil requested a review from sgugger March 17, 2022 11:54

sgugger approved these changes Mar 17, 2022

View reviewed changes

Narsil added 3 commits March 18, 2022 09:01

Attention mask is important in the case of batching...

778a94f

Improve the fix.

df9a6a7

Making the sentence different enough that they exhibit different

e0bc450

predictions.

Narsil force-pushed the fix_attention_mask_in_token_classif_pipeline branch from a6bf0fc to e0bc450 Compare March 18, 2022 08:01

Narsil merged commit ecb4662 into huggingface:master Mar 18, 2022

Narsil deleted the fix_attention_mask_in_token_classif_pipeline branch March 18, 2022 09:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Attention mask is important in the case of batching... #16222

Attention mask is important in the case of batching... #16222

Uh oh!

Narsil commented Mar 17, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Mar 17, 2022 •

edited

Loading

Uh oh!

Narsil Mar 17, 2022

Uh oh!

Narsil Mar 17, 2022

Uh oh!

Narsil Mar 17, 2022

Uh oh!

sgugger Mar 17, 2022

Uh oh!

Narsil Mar 17, 2022

Uh oh!

sgugger left a comment

Uh oh!

sgugger Mar 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Attention mask is important in the case of batching... #16222

Attention mask is important in the case of batching... #16222

Uh oh!

Conversation

Narsil commented Mar 17, 2022

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Mar 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Narsil Mar 17, 2022

Choose a reason for hiding this comment

Uh oh!

Narsil Mar 17, 2022

Choose a reason for hiding this comment

Uh oh!

Narsil Mar 17, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger Mar 17, 2022

Choose a reason for hiding this comment

Uh oh!

Narsil Mar 17, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Mar 17, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Mar 17, 2022 •

edited

Loading