Fix output shape when resuming generation #211

borzunov · 2023-01-13T12:07:14Z

Before this PR, model.generate() returned one excess token when resuming generation with an existing (the last token of the previous session, session.last_token_id). This is an unexpected behavior not convenient for the downstream apps, so this PR changes it until it's too late.

Fix outputs when resuming generation

7305a65

borzunov force-pushed the fix-resuming-generation branch from 91b721e to 7305a65 Compare January 13, 2023 12:07

borzunov changed the title ~~Fix outputs when resuming generation~~ Fix output format when resuming generation Jan 13, 2023

borzunov changed the title ~~Fix output format when resuming generation~~ Fix output shape when resuming generation Jan 13, 2023

Update packaging version to resolve warning on Colab

c1f1088

borzunov merged commit 6ba63c6 into main Jan 13, 2023

borzunov deleted the fix-resuming-generation branch January 13, 2023 12:27

borzunov added a commit to petals-infra/chat.petals.dev that referenced this pull request Jan 13, 2023

Update code to match with bigscience-workshop/petals#211

b33c58d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix output shape when resuming generation #211

Fix output shape when resuming generation #211

borzunov commented Jan 13, 2023 •

edited

Loading

Fix output shape when resuming generation #211

Fix output shape when resuming generation #211

Conversation

borzunov commented Jan 13, 2023 • edited Loading

borzunov commented Jan 13, 2023 •

edited

Loading