Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Browsers don't hyphenate Mongolian text #40

Open
r12a opened this issue Feb 15, 2021 · 5 comments
Open

Browsers don't hyphenate Mongolian text #40

r12a opened this issue Feb 15, 2021 · 5 comments
Labels
doc:mong gap i:hyphenation Hyphenation (a subset of Line-breaking & hyphenation) i:line_breaking Line breaking & hyphenation l:mn Mongolian p:advanced s:mong Mongolian script x:blink x:gecko x:mong x:webkit

Comments

@r12a
Copy link
Contributor

r12a commented Feb 15, 2021

Hyphenation occurs in writing Mongolian and Todo. U+1806 MONGOLIAN TODO SOFT HYPHEN is used to indicate resumption of a broken word. It functions like U+2010 HYPHEN, except that it appears at the beginning of a line rather than at the end. (Note that lines of Mongolian text are vertical, and progress from left to right.)

Specs:
issue Better describe the likely outcomes of hyphenation Open.

css-text Describes how to apply hyphenation. It makes no special mention of Mongolian, nor of which character to use and where.

css-text Has a hyphenate-character property which will allow users to specify the character to use for hyphenation, but it doesn't allow control of the location of the character.

Tests & results:

Webkit is unable to display traditional Mongolian script.

Interactive test, Mongolian text is hyphenated when hyphens:auto is set

  • Gecko:Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:94.0) Gecko/20100101 Firefox/94.0
  • Blink:Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/95.0.4638.69 Safari/537.36
  • Webkit: ❌ Safari doesn't display correctly. Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/15.1 Safari/605.1.15

Interactive test, Mongolian adds a hyphen to the start of the second line when a word is manually hyphenated with SHY

  • Gecko: ? Produces a vertical baseline extension at the bottom of the first line. Not clear whether this is just part of the cursive glyph or a hyphen.. Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:94.0) Gecko/20100101 Firefox/94.0
  • Blink: ❌ Produces a horizontal hyphen at the bottom of the first line. Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/95.0.4638.69 Safari/537.36
  • Webkit: ❌ Unable to correctly display traditional Mongolian script. Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/15.1 Safari/605.1.15

i18n test suite, CSS3 Text, hyphens

General tests for hyphens support. (Results may need updating.)

Browser bug reports:
GeckoBlinkWebkit

Priority:
Marked as advanced, since hyphenation is optional.

@r12a r12a added gap doc:mong i:hyphenation Hyphenation (a subset of Line-breaking & hyphenation) p:advanced labels Feb 15, 2021
@r12a
Copy link
Contributor Author

r12a commented Aug 12, 2021

The first comment in this issue contains text that will automatically appear in one or more gap-analysis documents as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.

Relevant gap analysis documents include:
Mongolian

@r12a
Copy link
Contributor Author

r12a commented Jan 11, 2023

I think this gap report needs to be completely rewritten. My expectation is that Mongolian words should not be split across a line, or need to use the SOFT HYPHEN. However, when compound nouns are separated by TODO SOFT HYPHEN, hyphen should move to the next line (and it has the right line-break property for that). Currently seeking clarification at #30

@r12a r12a mentioned this issue Jan 11, 2023
@asmusf
Copy link

asmusf commented Jan 12, 2023

My question: I keep hearing that there still are some open issues about the encoding model for (some aspects of) the (traditional) Mongolian script. If that assessment is correct, is it worth spending cycles on this issue? Or have those overarching issues been put to rest in the meantime?

@r12a
Copy link
Contributor Author

r12a commented Jan 12, 2023

I'm assuming that there is no connection between the handling of hyphens in Hudum and the encoding model changes (which focus on the letters).

@asmusf
Copy link

asmusf commented Jan 12, 2023

OK, but pending changes would make it difficult to have any implementation that can treat the text "correctly" in its entirety. At least not until they are settled. Just sayin'

@r12a r12a added the i:line_breaking Line breaking & hyphenation label May 1, 2024
@r12a r12a moved this to Bug in discussion in Gap-analysis pipeline Jun 20, 2024
@r12a r12a added s:mong Mongolian script l:mn Mongolian labels Jun 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
doc:mong gap i:hyphenation Hyphenation (a subset of Line-breaking & hyphenation) i:line_breaking Line breaking & hyphenation l:mn Mongolian p:advanced s:mong Mongolian script x:blink x:gecko x:mong x:webkit
Projects
Status: Bug in discussion
Development

No branches or pull requests

2 participants