Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LASER3-type SONAR encoder distillation script #17

Open
lydianish opened this issue Feb 12, 2024 · 4 comments
Open

LASER3-type SONAR encoder distillation script #17

lydianish opened this issue Feb 12, 2024 · 4 comments

Comments

@lydianish
Copy link

lydianish commented Feb 12, 2024

Following the discussion in this closed issue, I would like to request a LASER3-type distillation script of the SONAR text encoder.

@avidale
Copy link
Contributor

avidale commented Mar 11, 2024

Hi @lydianish!
In #8 (comment), I provided an example of distilling a speech encoder from the text one.
Here, you are looking for a way to distill a language-specific text encoder, right?
This is something that we could do as well.

@lydianish
Copy link
Author

Hi @avidale, thanks! Let me have a look.
Yes, I am looking for a way to distill a language-specific text encoder. I think my main concern was that I couldn't find any training scripts in Fairseq2 like we had in Fairseq for distilling LASER, but I see here you used Transformers so that explains it!

@avidale
Copy link
Contributor

avidale commented Mar 11, 2024

@lydianish In fact, I used Transformers only for a couple of helper functions, and wrote most of the training loop from scratch in plain Python and Torch. It is a deliberate decision not to force people in to any specific framework.

In the future, when we develop a robust and easy training pipeline for Fairseq2, we would probably publish it. But currently, the Fairseq2 API is still in rapid progress, and may undergo many backward-incompatible changes, so we don't want yet to popularize it.

@lydianish
Copy link
Author

@avidale Thanks for the clarification!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants