[Feature Extractors] Return attention mask always in int32 #13543

patrickvonplaten · 2021-09-13T11:06:32Z

What does this PR do?

This PR fixes:

tests/test_modeling_tf_hubert.py::TFHubertModelIntegrationTest::test_inference_ctc_robust_batched
tests/test_modeling_tf_wav2vec2.py::TFWav2Vec2ModelIntegrationTest::test_inference_ctc_robust_batched

For some specific use cases the attention mask for feature extractors was returned to be of type bool which broke two tf slow tests. Make sure that it's always int32 or long just like the tokenizers do for text.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…into attn_mask_in_int32

LysandreJik

LGTM, wdyt @Rocketknight1 ?

anton-l

Works like a charm! Since the tensor casting logic is not changed on the TF side of the library, this shouldn't cause any problems down the road.

Rocketknight1 · 2021-09-13T11:49:33Z

LGTM! I think lots of stuff breaks with boolean attention masks, so we definitely want to avoid sending that to the model.

return attention mask in int32

210e981

patrickvonplaten requested review from LysandreJik and anton-l September 13, 2021 11:13

Merge branch 'master' of https://github.com/huggingface/transformers …

52307f3

…into attn_mask_in_int32

LysandreJik approved these changes Sep 13, 2021

View reviewed changes

anton-l approved these changes Sep 13, 2021

View reviewed changes

patrickvonplaten merged commit 5c14fce into huggingface:master Sep 13, 2021

patrickvonplaten deleted the attn_mask_in_int32 branch September 13, 2021 12:03

Albertobegue pushed a commit to Albertobegue/transformers that referenced this pull request Jan 13, 2022

return attention mask in int32 (huggingface#13543)

5515d39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Extractors] Return attention mask always in int32 #13543

[Feature Extractors] Return attention mask always in int32 #13543

patrickvonplaten commented Sep 13, 2021 •

edited

LysandreJik left a comment

anton-l left a comment •

edited

Rocketknight1 commented Sep 13, 2021

[Feature Extractors] Return attention mask always in int32 #13543

[Feature Extractors] Return attention mask always in int32 #13543

Conversation

patrickvonplaten commented Sep 13, 2021 • edited

What does this PR do?

Before submitting

Who can review?

LysandreJik left a comment

Choose a reason for hiding this comment

anton-l left a comment • edited

Choose a reason for hiding this comment

Rocketknight1 commented Sep 13, 2021

patrickvonplaten commented Sep 13, 2021 •

edited

anton-l left a comment •

edited