Review Segment Break Transformation Rules (CSS Text Level 3) #211

kidayasuo · 2020-05-08T23:09:28Z

There are discussions in CSS WG regarding Segment Break Transformation Rules:

https://drafts.csswg.org/css-text-3/#line-break-transform (what it is and the rule)
https://drafts.csswg.org/css-text-3/#space-discard-set (current charset)
[css-text-3] Segment Break Transformation Rules for East Asian Width property of A csswg-drafts#337 (what do do with EAW=Ambiguous? closed)
[css-text-3] Should enclosed ideographic blocks be space-discarding? csswg-drafts#4992 (how about enclosed ideographic?)
Space between characters after joining two lines clreq#293 (corresponding issue in clreq. good illustration of what the Segment Break Transformation Rules is)
https://www.w3.org/TR/jlreq/ja/#character_classes (JLReq charsets)

We would like to review the rule to see if there are any remaining issues or areas which need discussions.

kidayasuo · 2020-05-09T02:33:16Z

[updated] Updated the data by removing ones that are actually fullwidth versions of the character, and by removing character classes that are inherently non-Japanese (cl-24-cl-27). It makes the list easier to examine.

List of characters listed in JLReq that are not Space Discarding according to https://drafts.csswg.org/css-text-3/#space-discard-set

NOT_SpaceDiscarding_JLReq_char.txt

xfq · 2020-05-09T03:59:55Z

There's also w3c/csswg-drafts#5017 , which is the new CSS issue for "ambiguous" characters.

kojiishi · 2020-05-09T08:25:12Z

The list is very much helpful, thank you very much, @kidayasuo! It looks to me that the list is reasonable; i.e., the current set of space-discarding unicode characters is reasonable from JLREQ perspective. /cc @fantasai

kidayasuo · 2020-05-09T11:31:54Z

A basic, but fundamental question. How much we can expect authors or editor software, if they fold line automatically, to corporate? In one extreme, we could say to CJ authors to fold lines only between two Kanjis. then we do not need any other rules than "the segment break transformation rule will not insert a space between two Kanjis". (also, probably these expectations should be documented)

kojiishi · 2020-05-11T19:54:39Z

A basic, but fundamental question...

I think that is exactly where this is controversial. I'm in favor of making rules as simple as possible, because no matter what we do, authors must remember all the rules, and adopt to it. @r12a seems to have similar opinion if I understand his comment correctly. I see some people arguing more rules can make it smarter. I agree they help some cases but authors must remember more.

kidayasuo · 2020-05-12T00:26:21Z

Thank you. I agree we should make the rule easier to remember, in another word intuitive. It also needs to be reliable and in that sense I am not so much fond of language tagging idea because it is more prone to errors.

One little caution is that, in general, things that look simpler for human and easier to remember does not necessarily match something that is simple for rule makers. I think we should strive to devise a "smart" rules that feels simpler to people or our users.

MurakamiShinyu mentioned this issue May 19, 2020

[css-text-3] Segment Break Transformation Rules around CJK Punctuation w3c/csswg-drafts#5086

Open

himorin mentioned this issue Jan 5, 2021

update descriptions in this repository #249

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Review Segment Break Transformation Rules (CSS Text Level 3) #211

Review Segment Break Transformation Rules (CSS Text Level 3) #211

kidayasuo commented May 8, 2020

kidayasuo commented May 9, 2020 •

edited

Loading

xfq commented May 9, 2020

kojiishi commented May 9, 2020

kidayasuo commented May 9, 2020

kojiishi commented May 11, 2020

kidayasuo commented May 12, 2020

Review Segment Break Transformation Rules (CSS Text Level 3) #211

Review Segment Break Transformation Rules (CSS Text Level 3) #211

Comments

kidayasuo commented May 8, 2020

kidayasuo commented May 9, 2020 • edited Loading

xfq commented May 9, 2020

kojiishi commented May 9, 2020

kidayasuo commented May 9, 2020

kojiishi commented May 11, 2020

kidayasuo commented May 12, 2020

kidayasuo commented May 9, 2020 •

edited

Loading