Adding support for multiple mask tokens. #14716

Narsil · 2021-12-10T14:25:46Z

What does this PR do?

Original implem: the change from single mask to multi mask support for pytorch #10222

When presented multiple masks, it's impossible to retrieve the conjugate probabilities.
Instead of trying to workaround that (see discussions in previous PR) this PR instead
just outputs the raw top_k proposition at each locus, since it gets trickier to find a good
proxy for "joint probabilities". Instead of trying to solve this impossible problem we simply
show exactly what the model outputs.

@naveenjafer mentionned as co-author since much of this PR was pulled from there.

This PR was resurrected partly because Perceiver (byte-level model) need to do this type of masking to be useful.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

- Original implem: huggingface#10222 Co-authored-by: njafer <naveen.jafer@oracle.com>

we add information to the tasks to specify tasks where we know for sure if we need the tokenizer/feature_extractor or not.

LysandreJik

This looks good to me, cool work @Narsil.

Regarding the conversations in #10222, should this be marked as experimental for the first few weeks so that conversation may occur on this feature?

Could you add a note about the support of this feature in the documentation?

LysandreJik · 2021-12-14T11:14:53Z

src/transformers/pipelines/__init__.py

+NO_FEATURE_EXTRACTOR_TASKS = set()
+NO_TOKENIZER_TASKS = set()
+for task, values in SUPPORTED_TASKS.items():
+    if values["type"] == "text":
+        NO_FEATURE_EXTRACTOR_TASKS.add(task)
+    elif values["type"] in {"audio", "image"}:
+        NO_TOKENIZER_TASKS.add(task)
+    elif values["type"] != "multimodal":
+        raise ValueError(f"SUPPORTED_TASK {task} contains invalid type {values['type']}")


I like this approach, it should make the pipelines more robust to models with different capabilities in terms of preprocessors.

Looks good to me as well!

src/transformers/pipelines/fill_mask.py

+ marked as experimental.

NielsRogge · 2021-12-14T13:52:40Z

tests/test_pipelines_fill_mask.py

@@ -340,3 +338,27 @@ def fill_mask_with_duplicate_targets_and_top_k(self, model, tokenizer):
        # The target list contains duplicates, so we can't output more
        # than them
        self.assertEqual(len(outputs), 3)
+
+    def fill_mask_with_multiple_masks(self, model, tokenizer):


Can we perhaps add a test for Perceiver (similar to the image classification models)?

Or is this not required here?

I think this PR is pretty orthogonal to Perceiver.

We could add a slow test for sure, but it doesn't have to be Perceiver specific.

In fact, I'll add something on the random model (it just needs to be consistent, actual values are less important)

LysandreJik

Ok this looks good to me! Looking forward to the additional test, feel free to merge whenever.

* Adding support for multiple mask tokens. - Original implem: huggingface#10222 Co-authored-by: njafer <naveen.jafer@oracle.com> * In order to accomodate optionally multimodal models like Perceiver we add information to the tasks to specify tasks where we know for sure if we need the tokenizer/feature_extractor or not. * Adding info in the documentation about multi masks. + marked as experimental. * Add a copy() to prevent overriding the same tensor over and over. * Fixup. * Adding small test for multi mask with real values.. Co-authored-by: njafer <naveen.jafer@oracle.com>

Narsil force-pushed the fill_mask_multi_mask branch from a3fca5e to 34c02f4 Compare December 10, 2021 14:27

Narsil requested review from NielsRogge and LysandreJik December 10, 2021 14:27

Narsil force-pushed the fill_mask_multi_mask branch 2 times, most recently from 8068d62 to 61686e2 Compare December 14, 2021 10:07

Narsil and others added 2 commits December 14, 2021 11:10

Adding support for multiple mask tokens.

da96fd0

- Original implem: huggingface#10222 Co-authored-by: njafer <naveen.jafer@oracle.com>

In order to accomodate optionally multimodal models like Perceiver

5b0189e

we add information to the tasks to specify tasks where we know for sure if we need the tokenizer/feature_extractor or not.

Narsil force-pushed the fill_mask_multi_mask branch from 61686e2 to 5b0189e Compare December 14, 2021 10:30

LysandreJik reviewed Dec 14, 2021

View reviewed changes

Narsil added 3 commits December 14, 2021 14:24

Adding info in the documentation about multi masks.

447daa7

+ marked as experimental.

Add a copy() to prevent overriding the same tensor over and over.

a367d56

Fixup.

d676249

NielsRogge reviewed Dec 14, 2021

View reviewed changes

LysandreJik approved these changes Dec 14, 2021

View reviewed changes

Adding small test for multi mask with real values..

6ff1a2f

Narsil merged commit e7ed7ff into huggingface:master Dec 14, 2021

Narsil deleted the fill_mask_multi_mask branch December 14, 2021 15:46

breandan mentioned this pull request Feb 13, 2022

the change from single mask to multi mask support for pytorch #10222

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for multiple mask tokens. #14716

Adding support for multiple mask tokens. #14716

Narsil commented Dec 10, 2021 •

edited

Loading

LysandreJik left a comment

LysandreJik Dec 14, 2021

NielsRogge Dec 14, 2021

NielsRogge Dec 14, 2021 •

edited

Loading

Narsil Dec 14, 2021

LysandreJik left a comment

Adding support for multiple mask tokens. #14716

Adding support for multiple mask tokens. #14716

Conversation

Narsil commented Dec 10, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik Dec 14, 2021

Choose a reason for hiding this comment

NielsRogge Dec 14, 2021

Choose a reason for hiding this comment

NielsRogge Dec 14, 2021 • edited Loading

Choose a reason for hiding this comment

Narsil Dec 14, 2021

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

Narsil commented Dec 10, 2021 •

edited

Loading

NielsRogge Dec 14, 2021 •

edited

Loading