-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support homographs (was: Re-consider need to abort due to duplicated preferred terms) #1065
Comments
@ronaldtse i believe ISO 10241-1 clause 6.2.6 Homographs and antonyms talks about this issue. it suggests when these type of homographs cannot be avoided, homographs shall be defined in separate terminological entries. a cross reference in a note to entry between these entries can be useful. however that adds complexity dealing with the note to keep the xref or not. not sure if this is what you were looking for or not. i am not familiar with such duplicated multilingual terms. this does not happen, in my understanding in asian languages as the pictograms would be different, but i dont have much experience in this area. |
Thank you @ReesePlews for pointing that out! I think the answer is definitive now, in 6.2.6 they provided an example where the French preferred term is duplicated (as in homographs representing two different concepts). There is a special treatment for the "Note to entry", where the language that contains a homograph will need a particular "Note to entry" for cross-referencing, and the language where the homograph does not exist will need a "Note to entry" as a placeholder. Recognition that "homographs" exist: Homographs in different languages are accepted (even in the same domain): |
OK, the abort becomes a warning. We do need a notion of severity in warnings, as some warnings are clearly more important than others, and I don't want a warning about homographs to be missed. I'm already suppressing display of grammar errors, but I am inclined to introduce 3 severity levels, abort on 1, display 2, and not display 3. |
Thank you @ReesePlews and @ronaldtse , very helpful to see the chapter and verse on this |
Want to get rid of the separate |
Severe errors in standoc:
|
Abort errors are shown in HTML log wth pink background. Severe errors are in boldface. |
Holding off on escalating style warnings in child flavours, except for IETF, which is more finicky because its output needs to be XML validated:
|
ISO 5598 is a trilingual document, English, French and German.
It happens that in English, all preferred terms are unique (due to ISO DIR 2 requirements), but in French and German, it contains multiple duplicated preferred terms.
Personally I think there are reasons for these duplications, for example, look at this:
English:
German:
If in German the words for "pressure gauge" and "pressure measuring instrument" are the same, then we are not in a position to say that is wrong.
While one might say that ISO DIR 2 says:
But if the two terms have the same domain (or no domain) in English, are we to separate into new domains in French or German?
It is a known fact that different languages expresses concepts differently. Moreover, in a dictionary one word can have multiple definitions.
This same issue also happens in ISO/IEC 2382:
Perhaps this is a common vocabulary dataset issue?
Also FYI @ReesePlews
The text was updated successfully, but these errors were encountered: