Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Break anywhere fails on conjuncts #78

Open
r12a opened this issue Feb 4, 2020 · 1 comment
Open

Break anywhere fails on conjuncts #78

r12a opened this issue Feb 4, 2020 · 1 comment
Labels
doc:taml gap i:line_breaking Line breaking & hyphenation l:ta Tamil language & script p:advanced s:taml Tamil script x:taml

Comments

@r12a
Copy link
Contributor

r12a commented Feb 4, 2020

line-break:anywhere causes lines to break inside words. It should break lines on grapheme cluster boundaries for all consonant clusters apart from the 3 special conjuncts. Chrome doesn't support the anywhere value. Firefox and Safari behave as expected.

The exceptions are the sequences க்ஷ k͓ʂ, and ஶ்ரீ ʃ͓ɾī / ஸ்ரீ s͓ɾī (which are synonyms). These sequences should not be broken during line breaking. Correct line breaking of these conjunct-forming sequences are not supported by default by Unicode grapheme clusters (which split them in two), and requires the application of tailored rules.

Test: line-break:anywhere should not break shri or ksha conjuncts.

Firefox is ok for shri and for ksha without a vowel-sign, but in ரிக்ஷா leaves க் on previous line, still shaped for half a conjunct. Safari is ok for shri in HTML, but leaves ஸ் behind in textarea; for ksha, in textarea leaves க் behind, in HTML initially moves whole word to next line, then puts ரி back at end of line as you decrease the window width.

Similar results are produced for word-break: break-all, except that Chrome supports this property and value. Chrome wraps ஸ்ரீந as a single unit and ரிக்ஷா as a unit.

The impact of this is advanced, although it would be good to fix it.

@r12a r12a added i:line_breaking Line breaking & hyphenation gap p:advanced doc:taml labels Feb 4, 2020
@r12a
Copy link
Contributor Author

r12a commented Feb 4, 2020

The first comment in this issue contains text that will automatically appear in the Tamil gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.

@r12a r12a added the x:taml label May 18, 2021
@r12a r12a added the l:ta Tamil language & script label May 1, 2024
@r12a r12a added the s:taml Tamil script label Jul 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
doc:taml gap i:line_breaking Line breaking & hyphenation l:ta Tamil language & script p:advanced s:taml Tamil script x:taml
Projects
Status: Issue identified, needing investigation
Development

No branches or pull requests

1 participant