[Generation] Allow `inputs_embeds` as an input #14443

patrickvonplaten · 2021-11-18T11:58:16Z

What does this PR do?

This PR allows inputs_embeds to be used as an input argument for generate(). Fixes: #12218

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

src/transformers/generation_utils.py

tests/test_generation_utils.py

patrickvonplaten · 2021-11-19T12:54:57Z

tests/test_generation_utils.py

+        inputs_embeds = model.get_input_embeddings()(input_ids)
+
+        # cannot generate from `inputs_embeds` for decoder only
+        with pytest.raises(ValueError):


Decoder-only can't generate with inputs_embeds since predicted ids are append to starting ids => so it's assumed that starting ids are input_ids and not inputs_embeds

With not use self.assertRaises(ValueError) here?

sgugger

Thanks for adding support for this!

src/transformers/generation_utils.py

tests/test_generation_utils.py

sgugger · 2021-11-19T13:16:15Z

tests/test_generation_utils.py

+        inputs_embeds = model.get_input_embeddings()(input_ids)
+
+        # cannot generate from `inputs_embeds` for decoder only
+        with pytest.raises(ValueError):


With not use self.assertRaises(ValueError) here?

Narsil

LGTM, some nits (+ agree with @sgugger ones)

Narsil · 2021-11-19T13:54:37Z

tests/test_generation_utils.py

+            torch_device
+        )
+        model.config.eos_token_id = None
+        input_ids = tokenizer(article, return_tensors="pt").input_ids.to(torch_device)


What's the length of this ?
Without the information it's hard to infer if self.assertEqual(output_sequences.shape, (1, 5)) is actually correct.
(I know it's a encoder-decoder but it gets important in the other decoder only test.)

Maybe as additional guarantee we can force the decoder_input_ids (To prove to reader that some tokens were generated ?)

Well it comes pretty much only from max_length=5

added a comment

Narsil · 2021-11-19T13:55:04Z

tests/test_generation_utils.py

+        model = BartForConditionalGeneration.from_pretrained("hf-internal-testing/tiny-random-bart", max_length=5).to(
+            torch_device
+        )
+        model.config.eos_token_id = None


Why is that necessary for this test? is it so EOS is not produced before max_length?
Maybe add a small comment where this is present ?

Just to make sure it can't finish before hitting max_length

…ckvonplaten/transformers into fix_inputs_embeds_generate

patrickvonplaten added 3 commits November 18, 2021 12:45

up

1e8a46f

finalize

16297ce

finalize

fec32a7

patrickvonplaten changed the title up [Generation] Allow inputs_embeds as an input Nov 18, 2021

patrickvonplaten mentioned this pull request Nov 18, 2021

T5 model seq2seq text generation using word embeddings instead of token_ids does not work #12218

Closed

finish

9a60059

patrickvonplaten commented Nov 19, 2021

View reviewed changes

src/transformers/generation_utils.py Show resolved Hide resolved

patrickvonplaten commented Nov 19, 2021

View reviewed changes

src/transformers/generation_utils.py Outdated Show resolved Hide resolved

Update src/transformers/generation_utils.py

1d08fbc

patrickvonplaten commented Nov 19, 2021

View reviewed changes

tests/test_generation_utils.py Show resolved Hide resolved

patrickvonplaten commented Nov 19, 2021

View reviewed changes

patrickvonplaten requested review from Narsil, sgugger and LysandreJik November 19, 2021 12:55

sgugger approved these changes Nov 19, 2021

View reviewed changes

Narsil approved these changes Nov 19, 2021

View reviewed changes

patrickvonplaten added 2 commits November 19, 2021 15:16

apply feedback

2c0abbe

Merge branch 'fix_inputs_embeds_generate' of https://github.com/patri…

3b10f74

…ckvonplaten/transformers into fix_inputs_embeds_generate

patrickvonplaten merged commit f25a933 into huggingface:master Nov 19, 2021

This was referenced Nov 30, 2021

input decoder embedding for model.generate() #13917

Closed

Passing inputs_embeds into GenerationMixin.generate() #6535

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Generation] Allow `inputs_embeds` as an input #14443

[Generation] Allow `inputs_embeds` as an input #14443

patrickvonplaten commented Nov 18, 2021 •

edited

Loading

patrickvonplaten Nov 19, 2021

sgugger Nov 19, 2021

sgugger left a comment

sgugger Nov 19, 2021

Narsil left a comment

Narsil Nov 19, 2021

patrickvonplaten Nov 19, 2021

patrickvonplaten Nov 19, 2021

Narsil Nov 19, 2021

patrickvonplaten Nov 19, 2021

[Generation] Allow inputs_embeds as an input #14443

[Generation] Allow inputs_embeds as an input #14443

Conversation

patrickvonplaten commented Nov 18, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Narsil left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

[Generation] Allow `inputs_embeds` as an input #14443

[Generation] Allow `inputs_embeds` as an input #14443

patrickvonplaten commented Nov 18, 2021 •

edited

Loading