Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a python example on how to upload and use OPT #56

Closed
JoaoLages opened this issue May 6, 2022 · 4 comments
Closed

Add a python example on how to upload and use OPT #56

JoaoLages opened this issue May 6, 2022 · 4 comments
Labels
question Further information is requested

Comments

@JoaoLages
Copy link

First of all, thank you for open-sourcing OPT! 💪🚀

I'd like to ask for a simple example of how to use the new OPT model.
In here there could be a python code description on how to upload the model or, because probably the code is too big, point to a script with that example/tutorial.

@JoaoLages JoaoLages added the question Further information is requested label May 6, 2022
@spencer-hong
Copy link

Hi @JoaoLages, I'm not associated with Facebook, but based on looking through the repository, Facebook provides a script to consolidate your shards here. https://github.com/facebookresearch/metaseq/blob/main/metaseq/distributed/stitch_fsdp_ckpt.py

@nickums
Copy link

nickums commented May 10, 2022

Eager to use this amazing product, after following the setup instructions, I am looking for a way to start using it.
First, having selected a pre-trained model to use (reshard-model_part-0.pt etc),
what directory do i store them in, or, what load procedure should I use?
Second, what script to use for a simple 'prompt and response' interaction?
presumably using megatron/text-generation-server.py

@suchenzang
Copy link
Contributor

Consolidating all asks for model loading into #88, which will be released soon from HuggingFace.

We also have #78 and #77 open to fix some of these usability issues on our end.

@nickums
Copy link

nickums commented May 12, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants