Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dandas are wrapped alone to the beginning of a line #105

Closed
r12a opened this issue Feb 5, 2020 · 4 comments
Closed

Dandas are wrapped alone to the beginning of a line #105

r12a opened this issue Feb 5, 2020 · 4 comments
Labels
doc:gujr gap i:segmentation Grapheme/word segmentation & selection p:basic

Comments

@r12a
Copy link
Contributor

r12a commented Feb 5, 2020

Gujarati uses full stop to represent the end of the sentence. However if the user wants to use the danda or double danda, as per to Unicode recommendation, they have to come from Devanagari block of Unicode. Devanagari phrase separator । U+0964 DEVANAGARI DANDA or ॥ U+0965 DEVANAGARI DOUBLE DANDA are encoded in the DEVANAGARI block with the intent that they should be used as common punctuation for all the major scripts of India including Gujarati.

The properties of purna viram and deergh viram should be the same as the properties of FullStop or other punctuation marks, and a new line should not begin with purna viram and deergh viram.

@r12a r12a added i:segmentation Grapheme/word segmentation & selection gap p:basic doc:gujr labels Feb 5, 2020
@r12a
Copy link
Contributor Author

r12a commented Feb 5, 2020

The first comment in this issue contains text that will automatically appear in the Gujurati gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.

@fantasai
Copy link

Seems like a UAX14 issue. Has it been reported to Unicode?

@lianghai
Copy link

Dandas are already Break After (BA) in UAX #14 (see section “Dandas”), the class for general closing characters.

A long-standing issue though, is if dandas should be further specified to behave like Exclamation/Interrogation (EX) so a preceding space (it’s a common style to surround dandas with a pair of spaces, like the French way of typesetting question and exclamation marks) doesn’t cause a linebreak either. This issue ticket may be actually referring to that situation.

@r12a
Copy link
Contributor Author

r12a commented May 18, 2021

Closed in favour of #88

@r12a r12a closed this as completed May 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
doc:gujr gap i:segmentation Grapheme/word segmentation & selection p:basic
Projects
None yet
Development

No branches or pull requests

3 participants