Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No text is shown when using MII in fp32 and greedy search #102

Closed
marshmellow77 opened this issue Nov 17, 2022 · 2 comments
Closed

No text is shown when using MII in fp32 and greedy search #102

marshmellow77 opened this issue Nov 17, 2022 · 2 comments

Comments

@marshmellow77
Copy link

When using greedy search (do_sample=False) and dtype=fp32 the generated tokens are not shown in the output of the query. I believe the text generation is happening, because different values for max_new_tokens lead to different runtimes for the query. See this notebook as a minimal example.

@mrwyattii
Copy link
Contributor

We don't currently support fp32 for the Bloom models in MII & DeepSpeed-Inference. I believe this is because the checkpoints are all in half precision. We correctly check the configs with Bloom-176B model, but fail to do so for the smaller variants. I added a fix for this in #107

I just ran your example using fp16 and I see output.

@marshmellow77
Copy link
Author

Thansk for adding a warning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants