CPU-optimized NMT models for Firefox Translations.
The model files are hosted using Git LFS.
prod - higher quality models
dev - test models under development (can be of low quality or speed).
When a dev model has satisfactory quality, it is moved to prod.
Automatic evaluation is a part of pull request CI. It uses Microsoft and Google translation APIs and pushes results back to the branch (not available for forks). It is performed using firefox-translations-evaluation tool.
To see SacreBLEU datasets run
All models should be contributed to
dev folder first.
Create a pull Request to
main branch from another branch in this repo.
Create a Pull Request to
When it is reviewed and merged, another pull request to
main branch will be created by a maintainer to kick off automatic evaluation.
You can run model evaluation locally by running
Make sure to set environment variables
AZURE_TRANSLATOR_KEY to use Google and Microsoft APIs.
If you want to run it with
bergamot only, remove mentions of those variables from
bash scripts/update-results.sh and remove
Create a new release with a version tag
x.y.z following semantic versioning.
The models will be automatically uploaded to GCS bucket
Prefix of the vocabulary file in the model registry:
vocab.- vocabulary is reused for the source and target languages
trgvocab.- different vocabularies for the source and target languages
Suffix of the model file in the registry:
gemm-precision: int8shiftAllinference setting
gemm-precision: int8shiftAlphaAllinference setting
The prod/dev labels in this repo correspond to the labels in the legacy web extension and are not related to the native integration in Firefox.
- Spanish <-> English
- Estonian <-> English
- English <-> German
- Czech <-> English
- Bulgarian <-> English
- Norwegian Bokmål -> English
- Portuguese <-> English
- Italian <-> English
- Polish <-> English
- French <-> English
- Russian <-> English
- Persian (Farsi) <-> English
- Icelandic -> English
- Norwegian Nynorsk -> English
- Ukrainian <-> English
- Dutch <-> English
- Catalan -> English
- Hungarian -> English
- Finnish -> English