Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[USE] Several characters are in two USE classes each #475

Open
NorbertLindenberg opened this issue Aug 8, 2020 · 2 comments
Open

[USE] Several characters are in two USE classes each #475

NorbertLindenberg opened this issue Aug 8, 2020 · 2 comments
Labels
USE USE spec

Comments

@NorbertLindenberg
Copy link

U+00A0 has general category Zs, which puts it into USE class OTHER. It has Indic syllabic category Consonant_Placeholder, which puts it into USE class BASE_OTHER.

It should be BASE_OTHER, as the Unicode Standard recommends this character as the base to show nonspacing combining marks in isolation.


Document Details

Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.

@xadxura xadxura added the USE USE spec label Aug 13, 2020
@Richard57
Copy link

Given that Notepad on Windows 10 at least (but not Word of Microsoft Office Standard 2016 on Windows 10) is treating it as BASE_OTHER, I suggest the second line for the O/OTHER derivation be changed from "White space characters, UGC=Zs" to "This includes white space characters, UGC=Zs, except for U+00A0", or simply deleted as redundant. Failing that, mention that U+00A0 is an exception.

@NorbertLindenberg NorbertLindenberg changed the title [USE] U+00A0 is in two USE classes [USE] Several characters are in two USE classes each Sep 30, 2023
@NorbertLindenberg
Copy link
Author

Several other characters also are in two USE classes each:

U+0F01: InSC-A=Consonant → BASE; gc=So → SYM
U+0F04..U+0F06: InSC-A=Consonant → BASE; gc=Po → BASE_IND
U+104B: InSC=Consonant_Placeholder → BASE_OTHER; gc=Po → BASE_IND
U+1800, U+1807, U+180A: InSC-A=Consonant → BASE; gc=Po → BASE_IND
U+200D: jt=C → BASE; InSC=Joiner → Zero width joiner
U+11A3F, U+11A45: InSC=Consonant_Placeholder → BASE_OTHER ; gc=Po → BASE_IND
U+1E14F: InSC-A=Consonant → BASE; gc=So → SYM

InSC-A here means Indic_Syllabic_Category Additional.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
USE USE spec
Projects
None yet
Development

No branches or pull requests

3 participants