Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[myanmar] Reconsider punctuation_cluster #3649

Closed
behdad opened this issue Jun 10, 2022 · 3 comments
Closed

[myanmar] Reconsider punctuation_cluster #3649

behdad opened this issue Jun 10, 2022 · 3 comments
Assignees

Comments

@behdad
Copy link
Member

behdad commented Jun 10, 2022

See #3648 (comment) and discussion.

@behdad behdad self-assigned this Jun 11, 2022
behdad added a commit that referenced this issue Jun 11, 2022
Fixes #3649

This actually now allows Asat after the punctuation marks; something I see
in Wikipedia data.
behdad added a commit that referenced this issue Jun 11, 2022
Fixes #3649

This actually now allows Asat after the Myanmar punctuation marks;
something I see in Wikipedia data.
@behdad
Copy link
Member Author

behdad commented Jun 13, 2022

Should we report U+104A to Unicode? In the corpus I see both characters taking Asat. It's weird that one is categorized by Unicode and not the other.

@dscorbett
Copy link
Collaborator

Yes, if you’re sure the evidence is trustworthy, and not a typo or something like Zawgyi.

@behdad
Copy link
Member Author

behdad commented Jun 14, 2022

Yes, if you’re sure the evidence is trustworthy, and not a typo or something like Zawgyi.

I don't know that. Maybe @mhosken knows.

@behdad behdad closed this as completed in 2cbb775 Jun 15, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants