Classdef cache #3636

behdad · 2022-06-05T08:58:52Z

[layout] Use a cache for main input ClassDef of ChainContextLookups

I expected this to speed up NotoNastaliqUrdu, but didn't make a dent.
Need to profile on Linux to understand why. But it's mostly because
the cache is only effective when we enter the lookup from the main
loop (apply_forward), but NotoNastaliq mostly recurses...

behdad · 2022-06-05T11:16:10Z

The cache has a hit rate of ~10% on NotoNastaliqUrdu shaping. So I'm baffled that I'm not seeing any speedup whatsoever; well, I see maybe ~1% range only. But that's probably because the ClassDef2 lookups are only ~20% of shaping. So this makes sense. :'-(

behdad · 2022-06-05T11:38:24Z

I see ~2 percent speedup and ~1 percent overhead, adding up to net ~1 percent speedup...

behdad · 2022-06-05T12:46:48Z

Phew! Finally some progress!

Looks like NotoNastaliqUrdu spends most of its time in GPOS ContextFormat2 (dot positioning, etc...), and I was only applying the cache to ChainContextFormat2. Applying the cache to ContextFormat2 as well shows 16% speedup in NotoNastaliqUrdu. I observe a slowdown of less than 3% in Amiri as the overhead of new infrastructure, and much less than that for other tests.

_

Interestingly, the Gulzar font does a lot more lookups than NotoNastaliqUrdu, but is 7 times faster. Need to investigate when I have a profiler (Linux machine) around.
_
Update: I was testing wrong. Gulzar is indeed 20 times slower than NotoNastaliqUrdu. This branch speeds up Gulzar by 5% only, because Gulzar does a LOT of work still using ChainContextFormat3. Simon's Format3 to Format2 converter does not do a full job yet it seems. cc @simoncozens

behdad · 2022-06-05T13:10:37Z

cc @jfkthame @drott

… use it In indic, we don't have a pause location release the var.

simoncozens · 2022-06-05T18:20:17Z

I didn’t use format 3 to format 2 converter in Gulzar. I should try it.

behdad · 2022-06-07T15:13:47Z

I observe a slowdown of less than 3% in Amiri as the overhead of new infrastructure

Upon further measurement the slowdown is nothing like that. That one should have been noise. I don't see any measurable slowdown whatsoever.

…ormat2 This commit adds a per-lookup caching infrastructure to GSUB/GPOS, and uses it to cache input ClassDef.get_class value for (Chain)ContextLookupFormat2. For fonts heavy on use of heave class-based2 context matching, this shows a good speedup. For NotoNastaliqUrdu for example, I observe 17% speedup. Unfortunately not many other lookups can use a cache like this :(. #3636

Just use a bool to apply()

Saves a pointer per subtable

behdad · 2022-06-07T15:17:55Z

This caching opportunity was first introduced to me by @mhosken many years ago.

…rmat2 From the commit: + /* For ChainContextFormat2 we cache the LookaheadClassDef instead of InputClassDef. + * The reason is that most heavy fonts want to identify a glyph in context and apply + * a lookup to it. In this scenario, the length of the input sequence is one, whereas + * the lookahead / backtrack are typically longer. The one glyph in input sequence is + * looked-up below and no input glyph is looked up in individual rules, whereas the + * lookahead and backtrack glyphs are tried. Since we match lookahead before backtrack, + * we should cache lookahead. This decisions showed a 20% improvement in shaping of + * the Gulzar font. #3636

behdad · 2022-06-07T17:43:30Z

From last commit I pushed:

[layout-cache] Cache lookahead, not input, classdef in ChainContextFo…
…rmat2

From the commit:

+    /* For ChainContextFormat2 we cache the LookaheadClassDef instead of InputClassDef.
+     * The reason is that most heavy fonts want to identify a glyph in context and apply
+     * a lookup to it. In this scenario, the length of the input sequence is one, whereas
+     * the lookahead / backtrack are typically longer.  The one glyph in input sequence is
+     * looked-up below and no input glyph is looked up in individual rules, whereas the
+     * lookahead and backtrack glyphs are tried.  Since we match lookahead before backtrack,
+     * we should cache lookahead.  This decisions showed a 20% improvement in shaping of
+     * the Gulzar font.

behdad self-assigned this Jun 5, 2022

behdad force-pushed the classdef-cache branch 2 times, most recently from 7f80147 to e374f32 Compare June 5, 2022 10:17

behdad force-pushed the classdef-cache branch from e374f32 to 99a496d Compare June 5, 2022 11:18

behdad force-pushed the classdef-cache branch 3 times, most recently from 107cd7a to f69d227 Compare June 5, 2022 13:06

behdad added 2 commits June 5, 2022 08:15

[buffer] Add try_allocate for buffer variables

104dc85

[indic-like] Move allocation of syllable() buffer var to shapers that…

15543f7

… use it In indic, we don't have a pause location release the var.

behdad force-pushed the classdef-cache branch from f69d227 to c1cfcbe Compare June 5, 2022 14:16

behdad added 3 commits June 7, 2022 09:14

[gsubgpos] Remove apply_cached() entry point

d4c09e9

Just use a bool to apply()

[gsubgpos] Merge cache_enter and cache_leave entry points

5963cf4

Saves a pointer per subtable

behdad force-pushed the classdef-cache branch from 28d4719 to 5963cf4 Compare June 7, 2022 15:17

[gsubgpos] Document caching

c8fb048

behdad merged commit 845279c into main Jun 7, 2022

behdad deleted the classdef-cache branch June 7, 2022 15:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classdef cache #3636

Classdef cache #3636

behdad commented Jun 5, 2022

behdad commented Jun 5, 2022 •

edited

Loading

behdad commented Jun 5, 2022

behdad commented Jun 5, 2022 •

edited

Loading

behdad commented Jun 5, 2022

simoncozens commented Jun 5, 2022 •

edited

Loading

behdad commented Jun 7, 2022

behdad commented Jun 7, 2022

behdad commented Jun 7, 2022 •

edited

Loading

Classdef cache #3636

Classdef cache #3636

Conversation

behdad commented Jun 5, 2022

behdad commented Jun 5, 2022 • edited Loading

behdad commented Jun 5, 2022

behdad commented Jun 5, 2022 • edited Loading

behdad commented Jun 5, 2022

simoncozens commented Jun 5, 2022 • edited Loading

behdad commented Jun 7, 2022

behdad commented Jun 7, 2022

behdad commented Jun 7, 2022 • edited Loading

behdad commented Jun 5, 2022 •

edited

Loading

behdad commented Jun 5, 2022 •

edited

Loading

simoncozens commented Jun 5, 2022 •

edited

Loading

behdad commented Jun 7, 2022 •

edited

Loading