Generate: PT's `top_p` enforces `min_tokens_to_keep` when it is `1` #24111

gante · 2023-06-08T13:35:09Z

What does this PR do?

Contrary to our description in the docstring, PT's top_p was not enforcing min_tokens_to_keep when it was 1 (the default). TF and FLAX were fine. This PR corrects it, and adds a check on min_tokens_to_keep (must be a non-negative integer)

HuggingFaceDocBuilderDev · 2023-06-08T13:52:56Z

The documentation is not available anymore as the PR was closed or merged.

amyeroberts · 2023-06-08T14:20:47Z

top_p was not enforcing min_tokens_to_keep when it was 1

From the diff - I don't see how this is resolved. The checks ensure the value of min_tokens_to_keeps but doesn't seem to be conditional on top_p. Am I missing something?

gante · 2023-06-09T08:27:10Z

src/transformers/generation/logits_process.py

@@ -266,9 +268,8 @@ def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor) -> to

        # Remove tokens with cumulative top_p above the threshold (token with 0 are kept)
        sorted_indices_to_remove = cumulative_probs <= (1 - self.top_p)
-        if self.min_tokens_to_keep > 1:


@amyeroberts -- this line was preventing the application of min_tokens_to_keep when it was 1. Removing it sorts the problem :)

e.g. if we were to set top_p=0.0, .generate() would crash due to the lack of suitable continuations, despite the default of min_tokens_to_keep being 1.

After this fix, setting top_p=0.0 does not crash the code. Refresher: Top P selects all the tokens whose cumulative probability is >= top_p. Setting it to 0.0 means it should pick exactly one token.

Ah, it's a stupid ambiguity of English issue :D From the title I thought it meant 'when top_p is 1`.

amyeroberts

Thanks for fixing!

njhill · 2023-06-21T23:39:02Z

@gante I hit an issue related to this in the prior version of transformers, glad to see that it's fixed thanks! However why don't we enforce min_tokens_to_keep >= 1. 0 makes no sense right?

gante · 2023-06-22T09:33:07Z

@njhill true, the initial check should be against >=1, patching it

…uggingface#24111)

See huggingface/transformers#24111 I didn't add validation to the `__init__` method since it's not done for other values/warpers.

gante added 2 commits June 8, 2023 13:32

PT: top p enforces min_tokens_to_keep when it is 1

5d3bd55

nit

be8018d

gante requested a review from amyeroberts June 8, 2023 13:36

gante changed the title ~~Generate: PT's top_p enforces min_tokens_to_keep when it is 1~~ Generate: PT's top_p enforces min_tokens_to_keep when it is 1 Jun 8, 2023

gante commented Jun 9, 2023

View reviewed changes

amyeroberts approved these changes Jun 9, 2023

View reviewed changes

gante merged commit be10092 into huggingface:main Jun 9, 2023
22 checks passed

gante deleted the top_p_min_tokens branch June 9, 2023 12:20

novice03 pushed a commit to novice03/transformers that referenced this pull request Jun 23, 2023

Generate: PT's top_p enforces min_tokens_to_keep when it is 1 (h…

6e844fb

…uggingface#24111)

gante mentioned this pull request Jun 24, 2023

Generate: min_tokens_to_keep has to be >= 1 #24453

Merged

njhill mentioned this pull request Jul 4, 2023

fix(server): avoid errors for very small top_p values huggingface/text-generation-inference#544

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate: PT's `top_p` enforces `min_tokens_to_keep` when it is `1` #24111

Generate: PT's `top_p` enforces `min_tokens_to_keep` when it is `1` #24111

gante commented Jun 8, 2023

HuggingFaceDocBuilderDev commented Jun 8, 2023 •

edited

amyeroberts commented Jun 8, 2023

gante Jun 9, 2023

gante Jun 9, 2023

amyeroberts Jun 9, 2023

amyeroberts left a comment

njhill commented Jun 21, 2023

gante commented Jun 22, 2023

Generate: PT's top_p enforces min_tokens_to_keep when it is 1 #24111

Generate: PT's top_p enforces min_tokens_to_keep when it is 1 #24111

Conversation

gante commented Jun 8, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Jun 8, 2023 • edited

amyeroberts commented Jun 8, 2023

gante Jun 9, 2023

Choose a reason for hiding this comment

gante Jun 9, 2023

Choose a reason for hiding this comment

amyeroberts Jun 9, 2023

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

njhill commented Jun 21, 2023

gante commented Jun 22, 2023

Generate: PT's `top_p` enforces `min_tokens_to_keep` when it is `1` #24111

Generate: PT's `top_p` enforces `min_tokens_to_keep` when it is `1` #24111

HuggingFaceDocBuilderDev commented Jun 8, 2023 •

edited