OSError after a few translations #8

jonas-nothnagel · 2021-02-02T06:45:52Z

Hi and thanks for the cool library!

I want to include the translation function in one of my data pipelines that loops over thousands of text snippets. Without the GPU support and on Windows I was following the instructions in the other issue and successfully added the function.

from easynmt import EasyNMT
model = EasyNMT('opus-mt')

and I translate with:

language = detect_langs(text)
for each_lang in language:
   if (each_lang.lang != "en"):
      translated_text = model.translate(text, target_lang='en')

whereas text is a string.
However, after a few translations (2-3) I always run into this error:

OSError: Can't load tokenizer for 'Helsinki-NLP/opus-mt-ia-en'. Make sure that:
- 'Helsinki-NLP/opus-mt-ia-en' is a correct model identifier listed on 'https://huggingface.co/models'

Any idea what the problem could be?

The text was updated successfully, but these errors were encountered:

nreimers · 2021-02-02T07:21:41Z

One of your sentences was detected as language ia. But there is no translation model for ia -> en.

jonas-nothnagel · 2021-02-02T08:11:24Z

My mistake! Please excuse.
Perhaps you may allow a follow-up question. If I change the code to:

language = detect_langs(text)
for each_lang in language:
   if (each_lang.lang == "es" or each_lang.lang == "fr"):
      translated_text = model.translate(text, target_lang='en')

I still run into translation errors, such as no availability of Portugese - English (pt-> en). Does the model.translate() function detect languages again since it contradicts the detect_langs() in those cases?

nreimers · 2021-02-02T08:13:18Z

Yes, if you don't specify the source_lang, it detects the language again automatically.

You can fix it like this:

translated_text = model.translate(text, source_lang=your_detected_source_lang, target_lang='en')

jonas-nothnagel closed this as completed Feb 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OSError after a few translations #8

OSError after a few translations #8

jonas-nothnagel commented Feb 2, 2021 •

edited

nreimers commented Feb 2, 2021

jonas-nothnagel commented Feb 2, 2021 •

edited

nreimers commented Feb 2, 2021

OSError after a few translations #8

OSError after a few translations #8

Comments

jonas-nothnagel commented Feb 2, 2021 • edited

nreimers commented Feb 2, 2021

jonas-nothnagel commented Feb 2, 2021 • edited

nreimers commented Feb 2, 2021

jonas-nothnagel commented Feb 2, 2021 •

edited

jonas-nothnagel commented Feb 2, 2021 •

edited