Skip to content
This repository has been archived by the owner on Oct 9, 2023. It is now read-only.

Train general translation task #50

Closed
SeanNaren opened this issue Feb 2, 2021 · 1 comment
Closed

Train general translation task #50

SeanNaren opened this issue Feb 2, 2021 · 1 comment
Labels
enhancement New feature or request help wanted Extra attention is needed won't fix This will not be worked on

Comments

@SeanNaren
Copy link
Contributor

馃殌 Feature

Currently we use mbart-large-en-ro as defined here: https://github.com/PyTorchLightning/lightning-flash/blob/master/flash/text/seq2seq/translation/model.py#L39

We should move towards mbart-large and pre-train from this backbone. This would be cleaner and more applicable to a large number of languages.

The other consideration is that this model is around 600M parameters/2gb which is the reasonable size given the translation task. We may want to consider smaller variants for CI + quick iterations!

@SeanNaren SeanNaren added enhancement New feature or request help wanted Extra attention is needed labels Feb 2, 2021
@stale
Copy link

stale bot commented Apr 3, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the won't fix This will not be worked on label Apr 3, 2021
@stale stale bot closed this as completed Apr 15, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request help wanted Extra attention is needed won't fix This will not be worked on
Projects
None yet
Development

No branches or pull requests

1 participant