Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model weights contributions? #1463

Open
deep-diver opened this issue Feb 23, 2024 · 5 comments
Open

Model weights contributions? #1463

deep-diver opened this issue Feb 23, 2024 · 5 comments
Labels

Comments

@deep-diver
Copy link

There is a model called SOLAR. This model follows the same architecture as LLaMA2, but it has more layers which make it outstanding performer better than Mistral and even Mixtral at some points (open LLM Leaderboard)

In this case, what could be the contribution points?

@innat
Copy link

innat commented Feb 23, 2024

Is it? https://arxiv.org/abs/2312.15166

@deep-diver
Copy link
Author

@innat yes, that is the one. The model arch is based on LLaMA2, but it is basically a sort of mixture of LLaMA2 and Mistral.

@mattdangerw
Copy link
Member

@tirthasheshpatel is working on finishing up our llama2 implementation. Once it is ready, we could probably just extend our conversion script an d add this as a variant for llama2?

@deep-diver
Copy link
Author

@mattdangerw thanks! Looks like the conversion script is already ready.

Also wondering if you guys accept contributions on the reverse conversion script (keras to HF)

@hunkim
Copy link

hunkim commented Mar 12, 2024

@deep-diver thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants