Balinese shaping seems broken #387

brawer · 2017-01-05T18:34:41Z

Compare the rendering of ᬓ᭄ᬓᬼ U+1B13 U+1B44 U+1B13 U+1B3C in HarfBuzz 1.4.0 versus macOS 10.12.2 using NotoSansBalinese-Regular.ttf:

HarfBuzz:

CoreText:

punchcutter · 2017-01-06T18:35:54Z

This is the same as seen in https://github.com/googlei18n/noto-fonts/issues/572

punchcutter · 2017-01-06T20:12:35Z

I forgot to mention if I fix the font issue the shaping still doesn't work because of the update to hb-ot-shape-complex-use.cc where now there's a decomposition for Balinese 1B3C. I don't understand what that's there for since there's no canonical decomposition and the decomposition of this should be done in the font from what I can tell. This version of the font decomposes 1B3C into 1B42 and g170 (an unencoded glyph for the bottom half), but if I switch the order to g170 1B42 then shaping works as expected.

behdad · 2017-01-09T07:54:29Z

the update to hb-ot-shape-complex-use.cc where now there's a decomposition for Balinese 1B3C

Err. That's my bad. Let me fix.

We have had added this in Indic shaper to assist shaping these scripts. In Universal Shaping Engine however, it is up to font designer to decompose them. Hence moving them from Indic shaper to USE was wrong. Fixup for f6ba63b Part of fixing #387

behdad · 2017-01-09T07:58:17Z

@punchcutter Better now?

punchcutter · 2017-01-09T17:12:52Z

This looks good on the harfbuzz side, but the font also needs to be updated from what I can tell.

behdad · 2017-01-09T22:41:21Z

Humm. Why does CoreText get it right then, any guess?

punchcutter · 2017-01-09T23:41:35Z

CoreText doesn't seem to care about the order of the marks. I tried both possible orders of decomposing 1B3C and they are both fine in CoreText, but only one order works in harfbuzz (bottom followed by top). The same for Edge on Windows 10. This attached font doesn't work in harfbuzz or Windows 10, but if the decomposition order is swapped they both work fine. 1B3C is Top_And_Bottom in IndicPositionalCategory.txt, but when split the only order that's working correctly is bottom followed by top. The top mark 1B42 is encoded and considered a top mark, but the below base half is not encoded.

We have had added this in Indic shaper to assist shaping these scripts. In Universal Shaping Engine however, it is up to font designer to decompose them. Hence moving them from Indic shaper to USE was wrong. Fixup for f6ba63b Part of fixing harfbuzz#387

brawer · 2017-01-17T15:51:38Z

Hm, should the USE spec be clearer on ordering?

harfbuzz/harfbuzz#387

punchcutter · 2017-01-17T18:53:02Z

I think the USE spec is pretty clear on ordering, but this particular situation can be a little vague because once 1B3C is decomposed the bottom half becomes an unencoded mark. If the bottom mark is interpreted as Blw then it should work according to the spec, but it looks to me like the bottom mark is being interpreted as Abv. Still it's odd that if the decomposition order is switched in the ccmp then it works even though the order is then supposedly Blw Abv which is against the USE spec. This font has no Mark Attachment classes in the GDEF, but even if I try to add these marks to Mark Attachment classes they don't seem to be interpreted in a different way.

KrasnayaPloshchad · 2017-02-03T05:13:27Z

CoreText doesn't seem to care about the order of the marks. I tried both possible orders of decomposing 1B3C and they are both fine in CoreText, but only one order works in harfbuzz (bottom followed by top).

Maybe Core Text try to give support for so-called canonically equivalents as many as possible.

behdad · 2017-07-14T15:03:24Z

Uniscribe produces same output as HarfBuzz. As such, this is text encoding problem, as well as, arguably, CoreText bug for accepting it. cc @nedley

roozbehp added Android Priority-Medium labels Jan 5, 2017

brawer added a commit to unicode-org/text-rendering-tests that referenced this issue Jan 17, 2017

Point to HarfBuzz bug 387 in description of test case “SHBALI-1”

5478d93

harfbuzz/harfbuzz#387

behdad closed this as completed Jul 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Balinese shaping seems broken #387

Balinese shaping seems broken #387

brawer commented Jan 5, 2017

punchcutter commented Jan 6, 2017

punchcutter commented Jan 6, 2017

behdad commented Jan 9, 2017

behdad commented Jan 9, 2017

punchcutter commented Jan 9, 2017

behdad commented Jan 9, 2017

punchcutter commented Jan 9, 2017

brawer commented Jan 17, 2017

punchcutter commented Jan 17, 2017

KrasnayaPloshchad commented Feb 3, 2017

behdad commented Jul 14, 2017

Balinese shaping seems broken #387

Balinese shaping seems broken #387

Comments

brawer commented Jan 5, 2017

punchcutter commented Jan 6, 2017

punchcutter commented Jan 6, 2017

behdad commented Jan 9, 2017

behdad commented Jan 9, 2017

punchcutter commented Jan 9, 2017

behdad commented Jan 9, 2017

punchcutter commented Jan 9, 2017

brawer commented Jan 17, 2017

punchcutter commented Jan 17, 2017

KrasnayaPloshchad commented Feb 3, 2017

behdad commented Jul 14, 2017