Punjabi Gurmukhi title marked as German #11

bgo-eiu · 2022-12-25T15:49:28Z

Article addrd at: https://www.wikidata.org/wiki/Q115863887

The code automatically selected for the monolingual text title was German. Granted, it is possible this is a problem with the way the source data is marked up

rdmpage · 2022-12-26T20:46:31Z

@bgo-elu Language detection is not handled well, I need to do some work on this to improve its accuracy.

bgo-eiu · 2023-02-08T21:11:59Z

@rdmpage Would it be possible for the tool to have a few hard-coded language-to-journal matches for where accurate detection is unlikely? For example, I am interested to create items for quite a few articles published by this Brahui language journal like https://doi.org/10.54781/abz.v7i1.155

At the moment, the title gets detected as Arabic, and the label gets placed in "en" rather than "brh" since brh is not in the list - at least for cases like this, where the number of Brahui research journals is likely quite small to begin with, it might be simpler just to have this journal's DOI prefix associated with the language code. I suppose if you wanted to implement this systematically, it could be done with a query for existing items with both DOIs and a value for P407 "language of work or name" that is not among the most frequently published languages.

rdmpage · 2023-02-10T09:25:04Z

@bgo-eiu Interesting idea, I'll need to check what languages my code can detect. The idea of being able to set the default language makes sense.

rdmpage added bug Something isn't working language Issues related to language detection labels Dec 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Punjabi Gurmukhi title marked as German #11

Punjabi Gurmukhi title marked as German #11

bgo-eiu commented Dec 25, 2022

rdmpage commented Dec 26, 2022

bgo-eiu commented Feb 8, 2023

rdmpage commented Feb 10, 2023

Punjabi Gurmukhi title marked as German #11

Punjabi Gurmukhi title marked as German #11

Comments

bgo-eiu commented Dec 25, 2022

rdmpage commented Dec 26, 2022

bgo-eiu commented Feb 8, 2023

rdmpage commented Feb 10, 2023