Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for GPT-JT model + make it easier to plug in other LMs #31

Closed
anas-awadalla opened this issue Dec 16, 2022 · 1 comment · Fixed by #37
Closed

Add support for GPT-JT model + make it easier to plug in other LMs #31

anas-awadalla opened this issue Dec 16, 2022 · 1 comment · Fixed by #37
Assignees
Labels
enhancement New feature or request

Comments

@anas-awadalla
Copy link
Collaborator

GPT-JT seems like a great choice for the LM side as it outperforms many other open-source models and is considerably smaller. Currently, we only support OPT models, but in general, it would be great to add more options.

The current design of the Flamingo class makes it difficult to plug in any huggingface model, as you would need to write a class specifically for each model family. We should explore more modular ways to build this.

@i-gao
Copy link
Collaborator

i-gao commented Feb 11, 2023

#66

@i-gao i-gao closed this as completed Feb 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants