Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[css-text][text-spacing] Add black square & white square characters to the definition of ideographs? #9603

Open
xfq opened this issue Nov 16, 2023 · 6 comments
Labels
css-text-4 i18n-clreq Chinese language enablement i18n-jlreq Japanese language enablement i18n-tracker Group bringing to attention of Internationalization, or tracked by i18n but not needing response.

Comments

@xfq
Copy link
Member

xfq commented Nov 16, 2023

https://drafts.csswg.org/css-text-4/#ideographs

U+25A0 and U+25A1 can be used to represent a missing ideograph, and should be treated in the same way as Han characters in Chinese contexts. I'm not quite sure how these two characters are usually used in other contexts, though.

1

Example usage:

2

@xfq xfq added css-text-4 i18n-jlreq Japanese language enablement i18n-clreq Chinese language enablement i18n-tracker Group bringing to attention of Internationalization, or tracked by i18n but not needing response. labels Nov 16, 2023
@macnmm
Copy link

macnmm commented Nov 18, 2023

Since these characters exist in both cjk fonts and Latin fonts, they cannot be treated consistently based on code point alone. When they are full width and in a cjk font, they should be treated as ideographic in nature and get mojikumi spacing accordingly as well as be set upright in vertical text. Otherwise they should be treated as Latin characters.

@kidayasuo
Copy link

I think these would belong to a class that is neutral, i.e. a class which does not create an extra space with any characters.

@kidayasuo
Copy link

@xfq , how common is that usage? When one character is used in multiple ways (and glyphs) it makes things complicated…

@xfq
Copy link
Member Author

xfq commented Nov 22, 2023

This symbol is relatively common in the digitization of ancient Chinese books, especially the white square. It's used to indicate characters that are not legible.

@kidayasuo
Copy link

kidayasuo commented Nov 22, 2023

Thank you. Then I agree they should not make a space between other ideographs.

Can they be in a class that does not create space with any characters? If so we can treat them consistently with other symbols in the same way. If not we would need to special case these symbols, but when they are not used as a replacement character the behavior might look odd, i.e. among many similar symbols only these two make space with Latin characters and numbers.

@fantasai fantasai changed the title [css-text] Add black square & white square characters to the definition of ideographs? [css-text][text-spacing] Add black square & white square characters to the definition of ideographs? Jan 9, 2024
@macnmm
Copy link

macnmm commented Mar 19, 2024

Thank you. Then I agree they should not make a space between other ideographs.

Can they be in a class that does not create space with any characters? If so we can treat them consistently with other symbols in the same way. If not we would need to special case these symbols, but when they are not used as a replacement character the behavior might look odd, i.e. among many similar symbols only these two make space with Latin characters and numbers.

In my experience, there is always some pairing that requires some space or requires spacing logic beyond "do nothing". Consider full justification expansion, where your "do nothing" class could get no spacing where all other ideographs get 0-100% spacing before the H&J violation state kicks in. Therefore, I think all glyphs/characters need assignment to a class, and that class should be part of the spacing rules table at all times.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
css-text-4 i18n-clreq Chinese language enablement i18n-jlreq Japanese language enablement i18n-tracker Group bringing to attention of Internationalization, or tracked by i18n but not needing response.
Projects
None yet
Development

No branches or pull requests

3 participants