nobody trained the model for eng-san (sanskrit)? #36

rcappuccio · 2024-04-04T14:27:04Z

Hi, I noticed the presence of a huge corpus for english-sanskrit (2.6 millions sentences), but the corresponding model performed very poorly (BLEU 2.3).
Why is that?

jorgtied · 2024-04-04T18:50:07Z

There is probably something seriously wrong with the training data. Could you have a look?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nobody trained the model for eng-san (sanskrit)? #36

nobody trained the model for eng-san (sanskrit)? #36

rcappuccio commented Apr 4, 2024 •

edited

jorgtied commented Apr 4, 2024

nobody trained the model for eng-san (sanskrit)? #36

nobody trained the model for eng-san (sanskrit)? #36

Comments

rcappuccio commented Apr 4, 2024 • edited

jorgtied commented Apr 4, 2024

rcappuccio commented Apr 4, 2024 •

edited