Train general translation task #50

SeanNaren · 2021-02-02T15:13:14Z

🚀 Feature

Currently we use mbart-large-en-ro as defined here: https://github.com/PyTorchLightning/lightning-flash/blob/master/flash/text/seq2seq/translation/model.py#L39

We should move towards mbart-large and pre-train from this backbone. This would be cleaner and more applicable to a large number of languages.

The other consideration is that this model is around 600M parameters/2gb which is the reasonable size given the translation task. We may want to consider smaller variants for CI + quick iterations!

The text was updated successfully, but these errors were encountered:

stale · 2021-04-03T17:37:53Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

SeanNaren added enhancement New feature or request help wanted Extra attention is needed labels Feb 2, 2021

stale bot added the won't fix This will not be worked on label Apr 3, 2021

stale bot closed this as completed Apr 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train general translation task #50

Train general translation task #50

SeanNaren commented Feb 2, 2021

stale bot commented Apr 3, 2021

Train general translation task #50

Train general translation task #50

Comments

SeanNaren commented Feb 2, 2021

🚀 Feature

stale bot commented Apr 3, 2021