We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
We would like to create a script for creating a merged model by using the C-BTM method.
The script would take as input:
List of experts models from the [MDEL HF repo](https://huggingface.co/Multi-Domain-Expert-Layers). Name of the output model
The averaged model would be uploaded to the MDEL HF repo. It's model card should contain the names of the experts it was created from.
The text was updated successfully, but these errors were encountered:
I would also work on this too. @NourFahmy There are two steps that we could split 😃
Sorry, something went wrong.
@NourFahmy @kenhktsui Check out Minho's adapation of the clustering step from the cBTM repo.
Hi @kenhktsui - happy to take on inference and support where need be on clustering, and to fill any gaps from Minho's efforts.
I've put up a PR here
I've made the following assumptions I can easily fix:
kindly inform if anything else is needed!
cc: @mrcabbage972
kenhktsui
NourFahmy
No branches or pull requests
We would like to create a script for creating a merged model by using the C-BTM method.
The script would take as input:
The averaged model would be uploaded to the MDEL HF repo. It's model card should contain the names of the experts it was created from.
The text was updated successfully, but these errors were encountered: