Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

yustar_chaifen*.dict.yaml 字根问题 #44

Open
Dieken opened this issue May 25, 2024 · 2 comments
Open

yustar_chaifen*.dict.yaml 字根问题 #44

Dieken opened this issue May 25, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@Dieken
Copy link

Dieken commented May 25, 2024

这是 yustar_chaifen.dict.yaml, yustar_chaifen-tw.dict.yaml 与 https://yuhao.forfudan.com/chaifen.csv 对比的结果:

  1. 目前 yustar_chaifen*.dict.yaml 把 {虍头} 都写成了{虎头}。 请根据 chaifen.csv,把 chaifen yaml 的 {虎头} 改成 {虍头},这两个的笔画有明显区别,会在反查时懵圈。
  2. , , vs , , 西, 都是 CJK Basic,CJK Radicals CJK Stokes, Katakana 字符,很容易支持( 在 iOS 15.8.2 确认显示没问题,没装额外字体),最好也在 chaifen yaml 中区分下,变形有点大,没必要合并成前者。
  3. 27 ⺆ vs (U+E2C2 Private_Use_Area)𠂭 vs (U+E2C9 Private_Use_Area) 字形一模一样,建议 在 chaifen.csv 和 YuhaoSongti.ttf 中改用与 chaifen yaml 中一致的 code point。YuhaoSongti.ttf 中可以增加一个 char to glyph 映射以兼容。
1248 {虎头}( U+E199 Private_Use_Area) vs 虍(U+864D CJK_Unified_Ideographs)
 168 爪 vs 爫(U+722B CJK_Unified_Ideographs)
 155 {虎头} vs 虍(U+864D CJK_Unified_Ideographs)
 145 冖 vs ⺈(U+2E88 CJK_Radicals_Supplement)
  84 卄 vs 廾(U+5EFE CJK_Unified_Ideographs)
  38 キ vs (U+E001 Private_Use_Area)
  37 二 vs (U+E24C Private_Use_Area)
  32 乛 vs ㇆(U+31C6 CJK_Strokes)
  27 ⺆ vs (U+E2C2 Private_Use_Area)
  12 西 vs 覀(U+8980 CJK_Unified_Ideographs)
  11 日 vs (U+E121 Private_Use_Area)
   8 コ vs ユ(U+30E6 Katakana)
   8 豕 vs (U+E068 Private_Use_Area)
   8 𠂭 vs (U+E2C9 Private_Use_Area)
   6 乛 vs 乙(U+4E59 CJK_Unified_Ideographs)
   6 丄 vs (U+E154 Private_Use_Area)
   5 丩 vs (U+E2E5 Private_Use_Area)
image
@Dieken
Copy link
Author

Dieken commented May 25, 2024

  1. PUA U+24C 在 chaifen.yaml 中合并到 了,考虑下要不要在 chaifen yaml 只改叫 {飞右},因为可能有人没仔细看文档或者忘记了那两点算
  2. PUA U+E121{𠯮下} (U+20BEE, CJK Ext B, https://zi.tools/zi/𠯮) 或 𫩏(U+2BA4F, CJK Ext E, https://zi.tools/zi/𫩏)
  3. PUA U+E068𧰨(U+ 27C28 , CJK Ext B, https://zi.tools/zi/𧰨)

还剩几个 PUA 确实需要,没办法:

  1. {虍头} vs  U+E199
  2. {飞右} vs  U+E24C
  3. vs  U+E001
  4. vs  U+E154
  5. vs  U+E2E5

@Dieken
Copy link
Author

Dieken commented May 26, 2024

更完整的版本,纠正了错误的{虎头},这两个错误比较严重,是笔画数都不一致,谈不上是形变了。

下图第一列是 chaifen.csv 中用的 PUA 字符,第二列是期望在 chaifen*.dict.yaml 中使用的字符,尽量用同形的 CJKV 标准字符,第三列是 yustar-v3.4.5 的 chaifen*.dict.yaml 中使用的字符。

image
my %roots_mapping = (
    '' => '{虍头}',     # not {虎头}
    '' => '',        # not {虎头}
    '' => '',        # not 爪
    '' => '',        # not 冖
    '' => '',        # not 卄
    '' => '𰀁',       # not キ
    '' => '{飞右}',    # not 二
    '' => '',        # not 乛
    '' => '',        # not 西
    '' => '𫩏',       # not 日
    '' => '',        # not コ
    '' => '𧰨',        # not 豕
    '' => '',        # not 乛
    '' => '{虫下}',    # not 丄
    '' => '{𠬞左}',    # not 丩
    '' => '',        # not {高頭}
    '' => '',        # no PUA
    '' => '𰀪',       # not {两撇}
    '' => '𬺰',       # not {于下}
    '' => '𫝀',       # not {五下}
    '' => '𠀎',        # not {冓上}
    '' => '𠂎',        # not {卯左}
    '' => '𰀄',       # not {反彐}
    '' => '𰃦',       # not {向框}
    '' => '𠂇',        # not {左上}
    '' => '𠃜',       # not {眉上}
    '' => '𫶧',      # not {荒下}
    '' => '𣎆',       # not {贏頭}
    '' => '𱍸',      # not {齊右}
    '' => '𪚴',      # not {龜下}
    '' => '{𠇊右}',   # not 𧘇
    '' => '',       # not 牛
    '' => '',       # not 足
);

@forFudan forFudan added the enhancement New feature or request label Jul 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants