Enumerate through all glyphs in font #56

carlossless · 2016-11-17T14:21:19Z

Hi,

Great library. I was wondering whether it would be possible to get the full list of glyphs that are contained in a font and their unicode representations.

As far as I understand, right now there's only the ability to get single code characters from the character set. It would be cool if you could get multiple code points like U+1F468 U+200D U+1F469 U+200D U+1F467 U+200D U+1F467 and then get the glyph for their combination - 👨‍👩‍👧‍👦.

The text was updated successfully, but these errors were encountered:

devongovett · 2016-11-20T07:32:29Z

This is a very hard problem. It would essentially involve doing glyph substitution in reverse, from glyphs to characters instead of characters to glyphs. GSUB coverage tables may make this possible for opentype, but it would certainly not be easy especially when you consider the chaining substitution tables that are available. AAT uses state tables, so it would also be difficult if not impossible to go backward from matching states to the character sequences that might produce them. There may be an infinite number of combinations of characters that yield a single glyph.

carlossless · 2016-11-21T09:44:57Z

Thank you for this illuminating answer. Main reason why I wanted this feature was to extract all SBIX glyph PNGs and assign them their appropriate unicode name. I guess there should be other better ways of parsing the font in order to get those resources.

Pomax · 2016-11-21T18:50:07Z

You could mine the CMAP table data for that particular use case, since Unicode characters are unrelated to the visual changes that GSUB can effect. Find the coverage ranges, and run through the supported glyphs, resolving their glyph outline(s) one entry at a time?

devongovett · 2016-11-21T18:50:54Z

See http://github.com/devongovett/apple-color-emoji

moyogo · 2016-11-21T20:47:39Z

hb_input does that (for GSUB): https://github.com/googlei18n/nototools/blob/master/nototools/hb_input.py

devongovett · 2016-12-12T09:11:02Z

Played around with this a bit this weekend. See #60. It works for AAT-based fonts like Apple Color Emoji for now. You can try it out on that branch like this (I'll add a public API at some point). It returns an array of possible strings that would produce the given glyph.

font._layoutEngine.getStringsForGlyph(1039);
// => ['\u{1F468}\u200D\u{1F469}\u200D\u{1F467}\u200D\u{1F466}']

font._layoutEngine.getStringsForGlyph(730);
// => ['\u{1F3C3}', '\u{1F3C3}\u200D\u2642', '\u{1F3C3}\u200D\u2642\uFE0F']

Will probably update my apple-color-emoji package to use this at some point. Much easier to maintain.

devongovett mentioned this issue Dec 12, 2016

Add support for getting string inputs from glyphs #60

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enumerate through all glyphs in font #56

Enumerate through all glyphs in font #56

carlossless commented Nov 17, 2016 •

edited

Loading

devongovett commented Nov 20, 2016

carlossless commented Nov 21, 2016

Pomax commented Nov 21, 2016 •

edited

Loading

devongovett commented Nov 21, 2016

moyogo commented Nov 21, 2016 •

edited

Loading

devongovett commented Dec 12, 2016

Enumerate through all glyphs in font #56

Enumerate through all glyphs in font #56

Comments

carlossless commented Nov 17, 2016 • edited Loading

devongovett commented Nov 20, 2016

carlossless commented Nov 21, 2016

Pomax commented Nov 21, 2016 • edited Loading

devongovett commented Nov 21, 2016

moyogo commented Nov 21, 2016 • edited Loading

devongovett commented Dec 12, 2016

carlossless commented Nov 17, 2016 •

edited

Loading

Pomax commented Nov 21, 2016 •

edited

Loading

moyogo commented Nov 21, 2016 •

edited

Loading