Unicode values outside Basic Multilingual Plane cannot be processed correctly. #168

lixiangliu · 2015-12-12T13:31:33Z

For example:

font.stringToGlyphs('\u{2f894}') will return an array with two “.nodef” in it.

If font.glyphs.glyphs[...] is being used to access a certain glyph directly, then it will return an object with an undefined unicode value.

The text was updated successfully, but these errors were encountered:

lixiangliu · 2015-12-14T02:43:52Z

It seems this is the limitation of 16-bit mapping in cmap format 4.

fdb · 2015-12-15T10:08:11Z

Would it help if we parsed a different cmap format?

lixiangliu · 2015-12-15T10:28:15Z

In that case, cmap format 12 is probably a good choice then, it’s widely adopted and also the superset of format 4.

Jolg42 · 2016-07-29T15:56:33Z

@lixiangliu cmap 12 read support was just added with PR #207 😉

fdb · 2016-07-31T09:18:24Z

Closing this issue because of the cmap 12 support. @lixiangliu let me know if there's something else that blocks you from working with these fonts.

amart1n · 2016-12-05T09:45:14Z

The same issue still exists with values outside the BMP. Looks like it may be because of the limitations of charCodeAt() https://mathiasbynens.be/notes/javascript-unicode#code-point-from-string

Jolg42 · 2016-12-05T10:47:43Z

@amart1n You're right charCodeAt() "will always return a value that is less than 65536." (ref)

codePointAt() is the new method to use but because of the compatibility we would need to add Mathias's polyfill: https://github.com/mathiasbynens/String.prototype.codePointAt and use it instead of charCodeAt() to have a full Unicode support.

fpirsch · 2018-03-19T22:30:36Z

Closed through #338

lixiangliu changed the title ~~The unicode value outside Basic Multilingual Plane cannot be processed correctly.~~ Unicode values outside Basic Multilingual Plane cannot be processed correctly. Dec 12, 2015

fdb closed this as completed Jul 31, 2016

Jolg42 reopened this Dec 5, 2016

Jolg42 mentioned this issue Jun 14, 2017

Add support for openType SVG color fonts #193

Closed

talltyler mentioned this issue Jun 16, 2017

Addressing issues #168 related to not supporting multi byte characters #297

Closed

8 tasks

Jolg42 mentioned this issue Dec 2, 2017

Make cmap format 12 if needed #315

Merged

This was referenced Mar 13, 2018

Support UTF-16 surrogate pairs #331

Closed

Support non-Basic-Multilingual-Plane (BMP) characters #338

Merged

fpirsch closed this as completed Mar 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unicode values outside Basic Multilingual Plane cannot be processed correctly. #168

Unicode values outside Basic Multilingual Plane cannot be processed correctly. #168

lixiangliu commented Dec 12, 2015

lixiangliu commented Dec 14, 2015

fdb commented Dec 15, 2015

lixiangliu commented Dec 15, 2015

Jolg42 commented Jul 29, 2016

fdb commented Jul 31, 2016 •

edited

Loading

amart1n commented Dec 5, 2016

Jolg42 commented Dec 5, 2016

fpirsch commented Mar 19, 2018

Unicode values outside Basic Multilingual Plane cannot be processed correctly. #168

Unicode values outside Basic Multilingual Plane cannot be processed correctly. #168

Comments

lixiangliu commented Dec 12, 2015

lixiangliu commented Dec 14, 2015

fdb commented Dec 15, 2015

lixiangliu commented Dec 15, 2015

Jolg42 commented Jul 29, 2016

fdb commented Jul 31, 2016 • edited Loading

amart1n commented Dec 5, 2016

Jolg42 commented Dec 5, 2016

fpirsch commented Mar 19, 2018

fdb commented Jul 31, 2016 •

edited

Loading