Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language Code Difference #75

Closed
hdeval1 opened this issue May 11, 2022 · 1 comment
Closed

Language Code Difference #75

hdeval1 opened this issue May 11, 2022 · 1 comment

Comments

@hdeval1
Copy link

hdeval1 commented May 11, 2022

After getting the following tatoeba recipes to successfully run:

  • tatoeba-prepare
  • tatoeba-train
  • tatoeba-eval
    I have been working on utilizing the fine tuning capabilities on the models built. In this process I noticed that for the tatoeba recipes, SRCLANGS/TRGLANGS is set to a 3 letter language code (ISO-693-3) and all other recipes (including the finetuning recipes) use a two letter language code (ISO 693-1). It is quite confusing to have to switch back and forth, and definitely causes confusion for the Makefiles when they are searching for data files. Is there anyway to make all the recipes use all the same language code for consistency?
@hdeval1
Copy link
Author

hdeval1 commented May 16, 2022

Nevermind! I found the LanguageCodes tool in the tools folder and used that. Thanks!

@hdeval1 hdeval1 closed this as completed May 16, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant