-
Notifications
You must be signed in to change notification settings - Fork 452
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[subset] add an option to exclude glyphs specific to 'lang/script' #244
Comments
For the record, the two font files I tested are NotoSansCJKkr-Regular.otf and NotoSansKR-Regular.otf (1.002 version) Below is the 'ttx -l' output of the subsetted result from the 1st font.
The differences in CFF and GSUB can be explained by the former including all the variant (non-default) glyphs for CJK ideographs while the latter not doing so. It'll be taken care of by taking care of this issue. The difference in the size of cmap is a bit puzzling (89kB vs 75kB). When I dumpled the cmap table from both subsetted output and diff'd them, the only difference is as below. There's no character repertoire difference (as expected because I used the same character list to include), but 'length' are different in two cmap tables in the dump. I guess this should be a separate issue (cmap optimization). I'll file a new issue on this.
|
So, the glyph order is exactly the same? |
I was using subset.py to subset NotoSansCJKkr and NotoSansKR.
The former is the full repertoire Noto Sans CJK with the default glyphs set to Korean variants and non-Korean glyph variants accessible via locl. It's about 16MB.
The latter is a Korean specific subset (both in terms of character repertoire and glyph repertoire) and does not have non-Korean glyph variants for CJK ideographs. Its size is about 5MB.
I specified the exact same subset (of Unicode code points) and started with the above two original fonts.
The results are different mainly because the result of subsetting from the first keeps all the non-Korean variant glyphs for CJK ideographs while the subset from the second does not.
I propose to add an option to either exclude glyphs specific to 'language/scripts' (negative list) or only include glyphs specific to lang-system specified + default (positive list).
The text was updated successfully, but these errors were encountered: