salad salad salad salad salad salad salad salad #46

fdelapena · 2021-02-19T00:46:36Z

salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad sala salad salad salad salad salad sala salad salad salad salad salad salad salad salad salad sala salad salad salad salad salad salad salad salad salad salad salad salad sala salad salad salad sala salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad salad sala salad salad salad salad salad salad salad salad salad salad salad sala sala sala sala

randallmoraes · 2021-02-19T00:56:40Z

salad ?

PJ-Finlay · 2021-02-19T02:42:32Z

Looks like it likes salad.

This is an Argos Translate issue. I reproduced it and the sentence boundary detection and tokenization look fine. Argos Translate uses a Transformer as its sequence to sequence model. The model is a black box that can sometimes have weird outputs. If you post on the OpenNMT forum you might get a better answer. The PyTorch port for the training scripts is almost done which will have a larger model and more training resources but I'm not sure when an updated Spanish model will get trained. The new model will likely fix this specific issue and hopefully have fewer similar ones.

fdelapena · 2021-02-19T02:52:23Z

Thanks, I'll try posting about this there.
As a remark, it seems the text output in the Argos Translate you shown, it looks slightly different. Note the "sala sala sala sala sala" (without d) is not the same count and positioning. I guess the training data or iteration count were not the same.

Update: I've found the following post, not sure if related, with some proposals: https://forum.opennmt.net/t/repeated-phrases-in-the-translation/4155

PJ-Finlay · 2021-02-19T03:41:34Z

In general I don't think Argos Translate has deterministic translations the model itself was only trained once but for some reason sometimes comes up with different results. Based on the CTranslate Python docs it doesn't look like CTranslate supports the lock_ngram_repeat param they're talking about in the linked forum post.

guillaumekln · 2021-02-19T08:22:46Z

The training data mostly contains full sentences. So the model is good at translating sentences. But here the input is a single word which is a different task. If you want the model to perform well on these inputs, you should add such examples in the training data.

(I'm the author of CTranslate2. Feel free to tag me if you have any questions or issues. We are here to help.)

pierotofy · 2021-02-19T13:23:43Z

lol, I had a giggle at this :)

Hey @guillaumekln ✋ glad to have you here! CTranslate2 is pretty amazing.

- argosopentech/argos-translate#17 - LibreTranslate/LibreTranslate#46

PJ-Finlay · 2021-02-20T14:59:07Z

I added a Wiktionary scraping script so hopefully future models will do this better.

bruno-kakele · 2023-12-08T22:52:32Z

Hi @PJ-Finlay , sorry to tag you here, but for some reason my post was flagged as spam in the community forums: https://community.libretranslate.com/t/odd-translation-behavior-repeating-words/827

If I understand correctly, I need to release a more recent model for a language that includes the wiktionary data? How do I know if a language uses the Wiktextract data? (Based on this: Argos Open Tech , I cannot tell). The data-index.json seems to be outdated (can't find some languages there).

Thanks in advance

pierotofy added bug Something isn't working help wanted labels Feb 19, 2021

PJ-Finlay added a commit to argosopentech/argos-train that referenced this issue Feb 20, 2021

Added Wiktionary scraping

7cc353e

- argosopentech/argos-translate#17 - LibreTranslate/LibreTranslate#46

PJ-Finlay mentioned this issue Mar 10, 2021

Translating a text with arteristics to its sides #58

Closed

PJ-Finlay mentioned this issue Mar 21, 2021

Weird english -> japanese translations (bad training data?) argosopentech/argos-translate#58

Closed

pierotofy mentioned this issue Mar 28, 2021

bug in translating "translate" from english to arabic #66

Closed

pierotofy mentioned this issue Feb 18, 2022

Turkish totally broken #213

Closed

dingedi closed this as completed May 19, 2022

tassoman mentioned this issue Aug 4, 2022

can't build using docker-compose #292

Closed

pierotofy mentioned this issue Sep 28, 2022

Error in translating "Nyheter" from Swedish #322

Closed

pierotofy mentioned this issue Oct 16, 2022

Possible bug - Error translating the word 'from' from English to Chinese #331

Closed

PJ-Finlay mentioned this issue Feb 11, 2023

Translations with repetitions argosopentech/argos-translate#318

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

salad salad salad salad salad salad salad salad #46

salad salad salad salad salad salad salad salad #46

fdelapena commented Feb 19, 2021

randallmoraes commented Feb 19, 2021

PJ-Finlay commented Feb 19, 2021

fdelapena commented Feb 19, 2021 •

edited

PJ-Finlay commented Feb 19, 2021

guillaumekln commented Feb 19, 2021

pierotofy commented Feb 19, 2021

PJ-Finlay commented Feb 20, 2021

bruno-kakele commented Dec 8, 2023

salad salad salad salad salad salad salad salad #46

salad salad salad salad salad salad salad salad #46

Comments

fdelapena commented Feb 19, 2021

randallmoraes commented Feb 19, 2021

PJ-Finlay commented Feb 19, 2021

fdelapena commented Feb 19, 2021 • edited

PJ-Finlay commented Feb 19, 2021

guillaumekln commented Feb 19, 2021

pierotofy commented Feb 19, 2021

PJ-Finlay commented Feb 20, 2021

bruno-kakele commented Dec 8, 2023

fdelapena commented Feb 19, 2021 •

edited