Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172, generation_inputs should be a dict #14546

TranSirius · 2021-11-27T07:18:53Z

Fixing Bug

Fixes # (issue)

In trainer_seq2seq.py / Seq2SeqTrainer / prediction_step, Line 174 reads:

generated_tokens = self.model.generate(
    **generation_inputs,
    **gen_kwargs,
)

which require the generated_tokens to be a dict. However, in the else branch in Line 171, the generation_inputs is created as a Tensor object, which will cause a problem.

Fix this by creating generation_inputs as a dict, and add a key called input_ids.

…ct before feeding into self.model.generation()

patrickvonplaten · 2021-12-07T13:47:23Z

Hey @TranSirius,

Thanks a lot for your PR here! It looks good to me - @sgugger can you maybe take a look as well?

patrickvonplaten · 2021-12-07T13:47:35Z

Should we maybe write some tests for this use case as well?

sgugger

Thanks for fixing!

sgugger · 2021-12-07T17:10:06Z

Oops didn't see your comment @patrickvonplaten. Adding a test would be nice to have indeed @TranSirius if you want to work on it on a separate PR.

…tion_inputs should be a dict (huggingface#14546) * fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a dict before feeding into self.model.generation() * fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a dict before feeding into self.model.generation()

TranSirius added 2 commits November 27, 2021 15:16

fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a di…

6eb9a8d

…ct before feeding into self.model.generation()

fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a di…

f67bda7

…ct before feeding into self.model.generation()

LysandreJik requested review from patil-suraj and patrickvonplaten December 3, 2021 20:17

patrickvonplaten removed the request for review from patil-suraj December 7, 2021 13:47

sgugger approved these changes Dec 7, 2021

View reviewed changes

sgugger merged commit 39f1dff into huggingface:master Dec 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172, generation_inputs should be a dict #14546

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172, generation_inputs should be a dict #14546

TranSirius commented Nov 27, 2021

patrickvonplaten commented Dec 7, 2021

patrickvonplaten commented Dec 7, 2021

sgugger left a comment

sgugger commented Dec 7, 2021

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172, generation_inputs should be a dict #14546

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172, generation_inputs should be a dict #14546

Conversation

TranSirius commented Nov 27, 2021

Fixing Bug

patrickvonplaten commented Dec 7, 2021

patrickvonplaten commented Dec 7, 2021

sgugger left a comment

Choose a reason for hiding this comment

sgugger commented Dec 7, 2021