-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENHANCEMENT] Do you have a plan that supports Mixtral 8x7B? #649
Comments
The same question. |
I am working for it, but I am not sure if it will be accepted. |
@matrixssy any progress ? thank you. |
see #667 |
Hi, Please refer to this script for MoE/Mixtral training. |
Great! But I would like to know how to convert Hugging Face (HF) weights to Megatron (MG) format, and if it's possible to convert them back after training? |
Marking as stale. No activity in 60 days. |
No description provided.
The text was updated successfully, but these errors were encountered: