New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Check if Traditional/Simplified errors were introduced by the update of Unihan_Variants.txt to “2021-12-01 Unicode 15.0.0 draft” #97
Comments
Oops, I left the above two comments on the wrong issue, it seems. So I deleted them. |
This one is used in Trad: https://dict.revised.moe.edu.tw/search.jsp?md=1&word=%E6%B2%84&qMd=0&qCol=1 |
The rest seem correct to me. However, there are a few characters that don't display for me. |
These are the characters that aren't displayed (i.e., I get "tofu"). |
None of these display for me either at the moment. I have not investigated yet why, maybe they are new. |
Many of those which display as Tofu are apparently not new, for example '鿰' U+9FF0 https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=%E9%BF%B0 says: “The Unicode Standard (Version 3.2)” i.e. it is there for a long time already. The only source given at https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=%E9%BF%B0 is kIRG_GSource GKJ-00201 which according to https://www.unicode.org/reports/tr38/index.html#kIRG_GSource is “GKJ Terms in Sciences and Technologies (科技用字) approved by the China National Committee for Terms in Sciences and Technologies (CNCTST)” But if none of our fonts have glyphs for that, I guess it is something really obscure, maybe nobody really uses that. |
I think I’ll fix only that one and assume that the classification of all the other characters is correct. |
Sounds reasonable. |
Here's another Trad/Simp character that's only marked as Simp: 干 |
Thank you, great! |
I made a new issue with a list of all characters in the cangjie5 table which are currently classified as simplified only. |
Builds for Fedora are here: https://copr.fedorainfracloud.org/coprs/mfabian/ibus-table/
|
c1c39a3
Regenerate engine/chinese_variants.py for Unihan_Variants.txt from “2021-12-01 Unicode 15.0.0 draft”
The text was updated successfully, but these errors were encountered: