Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing translations that are not assigned to a sense #20

Closed
karlb opened this issue Mar 3, 2024 · 0 comments
Closed

Missing translations that are not assigned to a sense #20

karlb opened this issue Mar 3, 2024 · 0 comments

Comments

@karlb
Copy link
Owner

karlb commented Mar 3, 2024

Currently, the assumption is that for each Wiktionary, translations are either:

  • assigned to a sense or
  • assigned to the lexical entry, but they contain a gloss that describes the "sense" for the specific translations

Unfortunately, this does not allow handling of all translations. In the Spanish Wiktionary, translations are in a single section for the lexentry (so no directly assigned to a sense), but often contain a numeric reference (e.g. "[2]") to identify the sense. dbnary is smart enough to parse these and assign the translation to the sense in that case.

However, not all translations have these numeric sense references and therefore stay linked to the lexentry. These translations are currently lost to WikDict. It would make sense to include these translations with an empty sense/gloss.

Example:
https://es.wiktionary.org/wiki/monje

zcat ttl/es_dbnary_*.ttl.gz | awk 'BEGIN {RS=""} /_tr_.*monje/ {print "\n"$0}'

spa:__tr_deu_1_monje__sustantivo_masculino__1
        rdf:type                dbnary:Translation;
        dbnary:isTranslationOf  spa:monje__sustantivo_masculino__1;
        dbnary:targetLanguage   lexvo:deu;
        dbnary:writtenForm      "Mönch"@de .

spa:__tr_bre_1_monje__sustantivo_masculino__1
        rdf:type                dbnary:Translation;
        dbnary:isTranslationOf  spa:monje__sustantivo_masculino__1;
        dbnary:targetLanguage   lexvo:bre;
        dbnary:writtenForm      "manac'h"@br .

spa:__tr_eng_1_monje__sustantivo_masculino__1
        rdf:type                dbnary:Translation;
        dbnary:isTranslationOf  spa:monje__sustantivo_masculino__1;
        dbnary:targetLanguage   lexvo:eng;
        dbnary:writtenForm      "monk"@en .
@karlb karlb closed this as completed in c91779b Mar 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant