Which model should I use for machine translation? #1436

maurus56 · 2019-10-06T23:37:04Z

❓ Questions & Help

I’m interested in training a model for translating articles from Spanish to English. There is too little information (Tutorials) about MT, should I use BERT, XLM or any other one? Also could you explain how to train the proposed model feeding the data, and output the predicted translation.
And is there a way to use XLNet so when translating chapters of a book it can remember the context of the previous ones and better translate?

There is even a model by Microsoft (MASS) that looks simple to use, would you recommend it?

thomwolf · 2019-10-09T01:08:02Z

Hi I recommend using XLM from Facebook for MT currently: https://github.com/facebookresearch/XLM
We may add some models for MT in the mid-term though.

Bachstelze · 2019-11-17T09:38:28Z

MASS reports higher BLEU-scores than XLM which is good in pretraining an encoder, but lacks in the training description of the decoder. So we could try to extend the XML-R #1769 encoder with MASS.

Bachstelze · 2019-12-18T13:03:19Z

In which languages and domains are you interested?

maurus56 · 2020-01-08T10:26:23Z

I'm looking for translation mainly of Spanish and Chinese to English, mainly books and articles so maintaining an overall consistency of the terms and words is crucial.

Just a single way translation is enough and also it should be possible to further train the model on the already translated works.

mahlettaye · 2020-03-13T08:25:49Z

Please help me on how to use xmlr for summarization? Also if there is any example based on xmlr.

stale · 2020-05-12T09:41:05Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

rlouf added the usage label Oct 7, 2019

LysandreJik removed the usage label Feb 7, 2020

stale bot added the wontfix label May 12, 2020

stale bot closed this as completed May 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which model should I use for machine translation? #1436

Which model should I use for machine translation? #1436

maurus56 commented Oct 6, 2019

thomwolf commented Oct 9, 2019

Bachstelze commented Nov 17, 2019

Bachstelze commented Dec 18, 2019

maurus56 commented Jan 8, 2020 •

edited

mahlettaye commented Mar 13, 2020

stale bot commented May 12, 2020

Which model should I use for machine translation? #1436

Which model should I use for machine translation? #1436

Comments

maurus56 commented Oct 6, 2019

❓ Questions & Help

thomwolf commented Oct 9, 2019

Bachstelze commented Nov 17, 2019

Bachstelze commented Dec 18, 2019

maurus56 commented Jan 8, 2020 • edited

mahlettaye commented Mar 13, 2020

stale bot commented May 12, 2020

maurus56 commented Jan 8, 2020 •

edited