Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which model should I use for machine translation? #1436

Closed
maurus56 opened this issue Oct 6, 2019 · 6 comments
Closed

Which model should I use for machine translation? #1436

maurus56 opened this issue Oct 6, 2019 · 6 comments
Labels

Comments

@maurus56
Copy link

maurus56 commented Oct 6, 2019

❓ Questions & Help

I’m interested in training a model for translating articles from Spanish to English. There is too little information (Tutorials) about MT, should I use BERT, XLM or any other one? Also could you explain how to train the proposed model feeding the data, and output the predicted translation.
And is there a way to use XLNet so when translating chapters of a book it can remember the context of the previous ones and better translate?

There is even a model by Microsoft (MASS) that looks simple to use, would you recommend it?

@rlouf rlouf added the usage label Oct 7, 2019
@thomwolf
Copy link
Member

thomwolf commented Oct 9, 2019

Hi I recommend using XLM from Facebook for MT currently: https://github.com/facebookresearch/XLM
We may add some models for MT in the mid-term though.

@Bachstelze
Copy link

MASS reports higher BLEU-scores than XLM which is good in pretraining an encoder, but lacks in the training description of the decoder. So we could try to extend the XML-R #1769 encoder with MASS.

@Bachstelze
Copy link

In which languages and domains are you interested?

@maurus56
Copy link
Author

maurus56 commented Jan 8, 2020

I'm looking for translation mainly of Spanish and Chinese to English, mainly books and articles so maintaining an overall consistency of the terms and words is crucial.

Just a single way translation is enough and also it should be possible to further train the model on the already translated works.

@LysandreJik LysandreJik removed the usage label Feb 7, 2020
@mahlettaye
Copy link

Please help me on how to use xmlr for summarization? Also if there is any example based on xmlr.

@stale
Copy link

stale bot commented May 12, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the wontfix label May 12, 2020
@stale stale bot closed this as completed May 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

6 participants