HK glyph issues for characters with “番” #214

tamcy · 2018-12-04T10:11:25Z

Similar to #213 , this one is about how strokes of the standard glyph is interpreted, and the characters reported points to a larger issue.

First, the following 5 characters needs attention:

Codepoint
U+5BE9 審
U+64AD 播
U+65DB 旛
U+6A4E 橎
U+700B 瀋

The bottom right stroke of 釆 is different from CN. HK could use the glyphs from TW, but I also found it is intentional to use a separate glyph set for HK because of a design difference from TW:

Looks like the interpretation is that the center vertical stroke of 釆 in TW should be joined to the center vertical stroke of 田, as if they are one single stroke. HK, similar to JP/KR/CN, uses a disjoint version.

Seems the decision is based on TW’s standard document. In 宋體 and 方體 (No. 102651) masters, the center vertical strokes do look joined.

In this case, 5 new HK glyphs would be needed for the above characters.

But I’d argue that the disjoint version could also be used for TW, and it's a design but not standard difference:

The stroke-order document clearly shows that 釆 and 田 are written separately.
You can install the 宋體 SongTi, a.k.a. Serif/Direct Link and 楷書 Kaishu a.k.a. Script/Direct Link font files supplied by MoE to inspect the actual structure. The vertical strokes are are actually not joined.

I believe you are already aware about this, as Source Han Serif also uses a disjoint version.

There're also glyphs with disjoint version in Big5 and HKSCS range due to glyph sharing. For instance, U+52EB 勫, U+9131 鄱 and U+98DC 飜.

In contrast, TW glyph of U+7FFB 翻 (joined version) is used by HK.

Honestly the issue seems subtle to me. Personally I'd like to see it unified (such that the disjoint version is used). I can provide a comprehensive list of glyphs concerned if you are to take action on this issue.

hfhchan · 2018-12-04T13:11:18Z

Nit: While at it, it would be better if the second and third stroke for 釆 was handled consistently across different regions, since there is glyph sharing.

A while ago I also made a request to CLIAC to make the strokes more explictly disjointed in HKSCS-2016, which is why they look different from their Taiwanese cousins.

kenlunde · 2019-02-16T14:36:18Z

@tamcy I am thinking to unify these forms for TW and HK, using the disjoint form of 番, so please send to me—or add to this issue—a comprehensive list of affected Big Five and HKSCS code points at your earliest convenience. It seems that most of these will be TW glyphs that need 釆 and 田 to be separated.

For U+5E61 幡, the HK glyph will change identity to become the TW glyph, which will be shared by TW and HK.

tamcy · 2019-02-17T04:52:26Z

Sure. Here lists the characters with the 番 component, categorized into 5 tables.

Actions would be needed for characters in Table A-C. Table D-E are fine, and are included for completeness. (Table A-D are all Big5 codepoints, characters in table E are in HKSCS)

Table A (22 characters)

Table A character list

In this table, different glyphs are used for HK and TW, but their only difference is whether the center vertical lines of 番 appears joined.

For characters of this type, I believe your workflow would be renaming the HK glyphs to TW and have them shared among HK and TW.

Codepoint
U+50E0 僠
U+58A6 墦
U+5B0F 嬏
U+5B38 嬸
U+5D93 嶓
U+5E61 幡
U+71D4 燔
U+756A 番
U+78FB 磻
U+7C53 籓
U+7E59 繙
U+7FB3 羳
U+81B0 膰
U+8543 蕃
U+85E9 藩
U+87E0 蟠
U+8B52 譒
U+8B85 讅
U+8E6F 蹯
U+8F53 轓
U+9407 鐇
U+9C55 鱕

Table B (9 characters)

Table B character list

The TW glyphs in this table needs to be modified to have the vertical strokes disjointed. Possible additional actions are on the second column.

Codepoint	Additonal action/remarks
U+5BE9 審	Also remap HK to TW
U+64AD 播	Also remap HK to TW
U+65DB 旛	Also remap HK to TW
U+6A4E 橎	Also remap HK to TW
U+6F58 潘	Also remap HK to TW
U+700B 瀋	Also remap HK to TW
U+74A0 璠	Also remap HK to TW
U+76A4 皤	Also remap HK to TW
U+7FFB 翻	HK already mapped to TW. This font intentionally uses a different design of 羽 for CN and TW/HK, with only a few exceptions. Otherwise the CN glyph could be used for HK and TW.

Table C (1 character)

Table C character list

For this single character, CN glyph can be used for TW. The TW glyph can be removed.

Codepoint
U+89BE 覾

Table D (3 characters)

Table D character list (PDF)

HK and TW are shared with other regions, no action needed.

Codepoint	Remarks
U+52EB 勫	Mapped to CN
U+9131 鄱	Mapped to CN
U+9DED 鷭	Mapped to JP

Table E (5 characters)

Table E character list (PDF)

Characters in the last table are HKSCS characters. They only have glyphs dedicated to HK, so no action is neccessary.

Codepoint
U+4AA4 䪤
U+5643 噃
U+98DC 飜
U+210D3 𡃓
U+24ABA 𤪺

That's all.

kenlunde · 2019-02-17T14:51:15Z

@tamcy: Thank you. The action for Table C is now reflected in the table at the beginning of Issue #203. Tables D and E require no action, as you stated. I appreciate the completeness. I will record the actions for Tables A and B in the appropriate consolidated issues later today, then will close this issue.

kenlunde · 2019-02-17T15:57:40Z

The actions for Tables A and B are now reflected in the tables at the beginning of Issues #202, #204, and #207.

kenlunde · 2019-02-17T16:45:18Z

I just changed the action for the glyphs in Table A. The TW CMap will be changed so that the affected code points map to the HK glyph (see Issue #202), and it won't be until Version 3.000 that the HK glyphs will be renamed to become TW glyphs (see Issue #227), meaning that TW and HK will share the TW glyphs. In other words, the HK glyphs will be unused from the upcoming dot release.

kenlunde mentioned this issue Feb 17, 2019

Consolidation of Post-V2 Glyph Sharing Suggestions #203

Closed

This was referenced Feb 17, 2019

Consolidation of Miscellaneous Post-V2 Changes #207

Closed

Consolidation of Post-V2 Mapping Change Suggestions #202

Open

Consolidation of Post-V2 Glyph Correction Suggestions #204

Open

kenlunde closed this as completed Feb 17, 2019

kenlunde mentioned this issue Feb 17, 2019

Consolidation of V3 Glyph Sharing Suggestions #227

Open

kenlunde self-assigned this Feb 19, 2019

kenlunde added the consolidated label Feb 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HK glyph issues for characters with “番” #214

HK glyph issues for characters with “番” #214

tamcy commented Dec 4, 2018

hfhchan commented Dec 4, 2018

kenlunde commented Feb 16, 2019

tamcy commented Feb 17, 2019 •

edited

Loading

kenlunde commented Feb 17, 2019

kenlunde commented Feb 17, 2019

kenlunde commented Feb 17, 2019

HK glyph issues for characters with “番” #214

HK glyph issues for characters with “番” #214

Comments

tamcy commented Dec 4, 2018

hfhchan commented Dec 4, 2018

kenlunde commented Feb 16, 2019

tamcy commented Feb 17, 2019 • edited Loading

Table A (22 characters)

Table B (9 characters)

Table C (1 character)

Table D (3 characters)

Table E (5 characters)

kenlunde commented Feb 17, 2019

kenlunde commented Feb 17, 2019

kenlunde commented Feb 17, 2019

tamcy commented Feb 17, 2019 •

edited

Loading