Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend glossary with forbidden terms #846

Closed
danielnaber opened this issue Sep 12, 2015 · 6 comments
Closed

Extend glossary with forbidden terms #846

danielnaber opened this issue Sep 12, 2015 · 6 comments
Assignees
Labels
enhancement Adding or requesting a new feature.
Projects
Milestone

Comments

@danielnaber
Copy link

danielnaber commented Sep 12, 2015

Example: One might add folder and its translation to the glossary. Often, this implies that folder should be used, and not a synonym like directory. These forbidden terms could be specified as a regular expression (e.g. director(y|ies)) and there could be a check which makes sure the forbidden words are not used.

@nijel nijel added the enhancement Adding or requesting a new feature. label Sep 13, 2015
@nijel nijel added this to TODO in Glossary Feb 27, 2017
@fastestnoob
Copy link
Contributor

... or glossary just gives a message which says "you used another word for existing glossary term. are you sure?"

@comradekingu
Copy link
Contributor

http://www.korrekturavdelingen.no/vanlige-skrivefeil_a.htm has a list of the words that most often end up being wrong, and the corresponding mistakes that pop up. It would be nice to set up a relational search for those.

Our Norwegian Bokmål glossary has entries accompanied by illegal and shunned variants http://i18n.skulelinux.no/nb/Fellesordl.eng-no.html

If you see a strike-through word in the editor, it should serve as a reminder that the word is not to be used :)

@nijel
Copy link
Member

nijel commented Sep 18, 2020

I'm not sure using regexp is a good idea. The regexp experience so far is that it's a powerful feature, which many users are afraid and thus won't use.

Scope of this feature:

  • Add flag for glossary terms that this is a prohibited term
  • Add check looking at translation whether it contains one of these

Possible future extension:

  • Once we have rich editor, the forbidden terms could be highlighted there

@comradekingu
Copy link
Contributor

comradekingu commented Sep 18, 2020

I meant as an automated check. Write a word that doesn't exist, and it pops up asking you to fix it.
Not everyone has a spellchecker it seems.

It could be made impossible to save the string, and it would carry a flag if this check is avoided maybe?

The following is how I see dictionaries:

If the dictionary really is a database of single word strings and the type of word they are, it can be used to auto-translate and ensure consistency, with actually good results. Also it would make sense to work on the dictionary then, because it would be useful for the whole instance, and highlight missing words.
It also takes out most of the grind right away, as most strings are by law of entropy both short and repeated across a lot of components.

Call it the master source translation. Weblate already has a "used elsewhere" for translations.
There just needs to be a way of altering the "key" of duplicated words that are in a different tense or similar.(?)
Right now I have a basic idea of what words are used in what projects, but it doesn't scale, and the info does not populate well between translators. So either you have some consistency with one major translator (and consequently as consistent errors, or you have scaling towards but ultimately away from consistency.

Consequently, the way to start it is to take the entire translation memory of single word strings, sort it by frequency, and start weeding out the errors.

The problem with the regexp is there is no overview of what the available terms are. #4541

Don't mind me obviously, I am just talking out of my head, but then again it makes all the sense in my head.

@nijel
Copy link
Member

nijel commented Sep 18, 2020

@comradekingu Spell checker is option as well (see #443 or #977), but this issue is about prohibiting certain words. Typical example could be folder / dictionary mentioned in the initial post.

@nijel nijel added this to the 4.5 milestone Dec 4, 2020
nijel added a commit that referenced this issue Feb 4, 2021
@nijel nijel self-assigned this Feb 4, 2021
@nijel nijel moved this from TODO to In progress in Glossary Feb 4, 2021
@nijel nijel closed this as completed in fd6075a Feb 4, 2021
Glossary automation moved this from In progress to Done Feb 4, 2021
@github-actions
Copy link

github-actions bot commented Feb 4, 2021

Thank you for your report, the issue you have reported has just been fixed.

  • In case you see a problem with the fix, please comment on this issue.
  • In case you see a similar problem, please open a separate issue.
  • If you are happy with the outcome, don’t hesitate to support Weblate by making a donation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Adding or requesting a new feature.
Projects
Glossary
  
Done
Development

No branches or pull requests

4 participants