fixing prefix_allowed_tokens_fn #3276

nicola-decao · 2021-02-24T12:31:42Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos, doc improvements)
Did you read the contributor guideline?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Fixes the use of prefix_allowed_tokens_fn in generation. It was working for fairseq==0.9.0 (see https://github.com/facebookresearch/GENRE) but with the current version is broken.

PR review

Anyone in the community is free to review the PR once the tests have passed.

Did you have fun?

Make sure you had fun coding 🙃

facebook-github-bot

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

erip · 2021-03-01T16:48:19Z

I'm interested in seeing this land, but I was curious if it would be possible to also include a bit of documentation about this prefix_allowed_tokens_fn callable. I can't seem to find anything which explains what it's supposed to do or what shape it's supposed to be (f: (Tensor, list[int]) -> int?)

nicola-decao · 2021-03-01T16:59:07Z

I'm interested in seeing this land, but I was curious if it would be possible to also include a bit of documentation about this prefix_allowed_tokens_fn callable. I can't seem to find anything which explains what it's supposed to do or what shape it's supposed to be (f: (Tensor, list[int]) -> int?)

@erip you are right. Where do you think is the best place to write the signature of the function? here? https://github.com/pytorch/fairseq/blob/e5e8b3fee1e57a7abf35ad1a3ff223a2b7190c65/fairseq/search.py#L148

erip · 2021-03-01T17:03:11Z

@nicola-decao I think that makes good sense. There doesn't seem to be any documentation on the other search strategies, but this one is somewhat less straightforward since it's got the callback. Unless @myleott has other thoughts, I think throwing a docstring beneath the ctor would be great.

myleott · 2021-03-01T20:37:08Z

@nicola-decao if you can share a docstring here, I can update the imported version before merging

nicola-decao · 2021-03-02T08:35:01Z

@nicola-decao if you can share a docstring here, I can update the imported version before merging

@myleott Here you go:

prefix_allowed_tokens_fn: Callable[[int, torch.Tensor], List[int]]: If provided, this function constrains the beam search to allowed tokens only at each step. If not provided no constraint is applied. This function takes 2 arguments: the batch ID batch_id: int and a unidimensional tensor of token ids inputs_ids: torch.Tensor. It has to return a List[int] with the allowed tokens for the next generation step conditioned on the previously generated tokens inputs_ids and the batch ID batch_id. This argument is useful for constrained generation conditioned on the prefix, as described in Autoregressive Entity Retrieval https://arxiv.org/abs/2010.00904 and https://github.com/facebookresearch/GENRE.

nicola-decao · 2021-03-08T08:29:05Z

@myleott Any news on this? Is there something I should do?

facebook-github-bot

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

nicola-decao · 2021-04-05T08:13:44Z

@myleott Any news on this? I have a facebook AI project https://github.com/facebookresearch/GENRE that depends on this bug fix (for now I link people to my fork with the fix that is not ideal).

nicola-decao · 2021-05-26T09:07:27Z

@myleott @sshleifer can we please proceed on the merge here? It is really a minor change.

There is this Facebook AI project https://github.com/facebookresearch/GENRE that depends on this bug fix (for now I link people to my fork with the fix that is not ideal and may have trouble installing it).

erip · 2021-05-26T10:34:10Z

also cc @alexeib

Summary: # Before submitting - [ ] Was this discussed/approved via a Github issue? (no need for typos, doc improvements) - [x] Did you read the [contributor guideline](https://github.com/pytorch/fairseq/blob/master/CONTRIBUTING.md)? - [x] Did you make sure to update the docs? - [x] Did you write any new necessary tests? ## What does this PR do? Fixes the use of `prefix_allowed_tokens_fn` in generation. It was working for `fairseq==0.9.0` (see https://github.com/facebookresearch/GENRE) but with the current version is broken. ## PR review Anyone in the community is free to review the PR once the tests have passed. ## Did you have fun? Make sure you had fun coding � Pull Request resolved: #3276 Reviewed By: alexeib Differential Revision: D26725494 Pulled By: myleott fbshipit-source-id: ce3da725f36352687e5cb5d62a59b4c89ce0b0bc

fixing prefix_allowed_tokens_fn

ebb5870

facebook-github-bot added the CLA Signed label Feb 24, 2021

nicola-decao and others added 3 commits February 26, 2021 05:41

Merge branch 'master' into fixing_prefix_allowed_tokens_fn

ee3988d

Merge branch 'master' into fixing_prefix_allowed_tokens_fn

d0c2f87

b/w compat

105172f

facebook-github-bot reviewed Mar 1, 2021

View reviewed changes

Merge branch 'master' into fixing_prefix_allowed_tokens_fn

4c4d5a7

facebook-github-bot reviewed Mar 18, 2021

View reviewed changes

nicola-decao mentioned this pull request Apr 5, 2021

Point requirements.txt to Git repositories facebookresearch/GENRE#21

Closed

nicola-decao mentioned this pull request Apr 14, 2021

'super' object has no attribute 'generate'... Issues following example... facebookresearch/GENRE#25

Closed

facebook-github-bot closed this May 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixing prefix_allowed_tokens_fn #3276

fixing prefix_allowed_tokens_fn #3276

nicola-decao commented Feb 24, 2021

facebook-github-bot left a comment

erip commented Mar 1, 2021 •

edited

nicola-decao commented Mar 1, 2021

erip commented Mar 1, 2021

myleott commented Mar 1, 2021

nicola-decao commented Mar 2, 2021

nicola-decao commented Mar 8, 2021

facebook-github-bot left a comment

nicola-decao commented Apr 5, 2021 •

edited

nicola-decao commented May 26, 2021

erip commented May 26, 2021

fixing prefix_allowed_tokens_fn #3276

fixing prefix_allowed_tokens_fn #3276

Conversation

nicola-decao commented Feb 24, 2021

Before submitting

What does this PR do?

PR review

Did you have fun?

facebook-github-bot left a comment

Choose a reason for hiding this comment

erip commented Mar 1, 2021 • edited

nicola-decao commented Mar 1, 2021

erip commented Mar 1, 2021

myleott commented Mar 1, 2021

nicola-decao commented Mar 2, 2021

nicola-decao commented Mar 8, 2021

facebook-github-bot left a comment

Choose a reason for hiding this comment

nicola-decao commented Apr 5, 2021 • edited

nicola-decao commented May 26, 2021

erip commented May 26, 2021

erip commented Mar 1, 2021 •

edited

nicola-decao commented Apr 5, 2021 •

edited