New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Combining commas above are inconsistent with those below #186
Comments
Why using simple (wedge) shapes of combining commas for Noto Serif ? IMHO:
In all these cases, we should be able to use variation selectors to select the shape for punctuation signs, but for now there's no mechanism available in Unicode to use varaition selectors for combining characters (or possibly with precomposed characters that must preserve their canonical equivalence, where it will not be obvious if the variation selector used after a precomposed characters applies to the base character or to one of its embedded diacritics: for such case a base variation selector after a precomposed character should only apply to its base, if there's such variation registered for this base, but never to its embedded diacritics; and a combining variation selector with combining class 0 should only be valid after a combining character, or after a precomposed character in which case it will alter only the embedded combining character with the highest non-zero combining class in order to preserve the canonical equivalence...).
Of course they must always be consistant between precomposed characters and combining diacritics (beside their placement or rotations for combinations used in Baltic languages (and of course the possibility to cancel such default change of position or rotation for non-Baltic languages using them at their normal position, by encoding for example a CGJ between the base letter and the diacritic, so that they are no longer canonically equivalent to the Baltic combinations which must remain consistant independantly of the fact they were encoded as precomposed characters or as base letters plus a separate diacritic). This consideration should also be applied to Roumanian-like usage of cedillas which may look as comma diacritics: such change of shape and attachment used by default for Roumanian should be disavbled as well using a CGJ to preserve the position and shape of the cedilla below, without having to use language-specific features (meaning forcing document to correctly use correct language identification in rich-text formats for multilingual documents, something that is not possible in plain-text where language identification is only applicable to the whole document (and we should not depend on the encoding of language tags, i.e. with special characters in plane 14, something that is now discouraged, never needed for rich-text formats like HTML: language-specific OpenType "feature" tables should be avoided as much as possible, given that Unicode now also supports variant selectors when needed, and the encoding and usage of variants are defined by the Unicode standard itself in the UCD). |
To summarise where we are with this: precomposed forms with comma above use a different form of comma to the comma below and combining comma above. In Serif, it's a different form; in Sans, it's a different size. A good test string for the inconsistency is ģn̦p̒: I think this is clearly a bug; unfortunately I think the fix is less clear. I personally feel like bulb-shaped commas everywhere for the serif would make sense, and maybe large comma accents everywhere in the sans would make sense, but I don't have enough design experience to know. I think for now the answer is "ask @moyogo". :-) (I don't believe that the Unicode CGJ suggestion is a good way forward; we don't really want to be innovating our own Unicode conventions. Romanian cedilla shape is already selectable in the font through setting the language tag, and yes, not many applications correctly support that (browsers do!) but that's an application issue. We do the right thing, even if they don't.) |
Original issue reported on code.google.com by
qrc...@google.com
on 7 Aug 2014 at 2:23The text was updated successfully, but these errors were encountered: