IFTB draft #151

skef · 2023-08-22T10:03:23Z

I went through my IFTB commits from earlier this year and reorganized them against the current main branch. Unfortunately the result doesn't quite build yet, I think because the separate IFTB document (which does build) isn't in W3C's bibliography.

A few notes:

I left the rangefile documented even though I think we should remove it, because I figure the documentation should generally track the reference implementation.
Some sections in Overview.bs have "XXX" notes indicating further changes that need to be made.
As I noted earlier this is all very rough.

Preview | Diff

svgeesus · 2023-08-22T12:43:05Z

Hi @skef could you please go to your W3C account and then link your GitHub account to it. That enables the IPR bot to recognize you. Thanks!

garretrieger · 2023-09-05T22:15:03Z

Here's some high level thoughts, we can likely discuss a lot of this during the upcoming TPAC meeting:

It’s likely that the memory usage for the unpacked font will be quite high (eg. Noto Serif SC base file is ~1 mb uncompressed). This is partially because the format retains gids and ends up with lots of zero’d out regions for glyphs that aren’t yet present. I wonder if there are some ways that we can avoid doing this? I suspect browsers are going to be pretty concerned about overall memory usage.

a. Related to this there’s currently a requirement to change the first 4 bytes of the version tag on the font file before passing the font to a rendering process. This may end up necessitating a full copy of the fonts data prior to passing the font on to the rendering process further increasing memory usage. We might consider relaxing this requirement if the rendering process is aware of the IFTB version tag and capable of correctly processing the file.
The initial file can be large (even after compression, I saw 200k-400k in the fonts that are part of the demo). Part of this is because all non-glyph data is transferred in full as part of the initial file. I wonder if it might make sense to try and have data from additional tables that are indexed by glyph id sent in the chunk files (for example vmtx/hmtx). See a breakdown of table sizes in the initial file here: https://docs.google.com/spreadsheets/d/1AK43VvY4LAEuttNPLoXulmedjRs8y44A4ZtN0lcT2QU/edit?resourcekey=0-uRS36zkMiyBmi8fG_U70ZA#gid=0
Font collections: when developing patch subset a concern was raised about making sure it supported font collections. So we’ll possibly need to think about what it would take to support font collections in IFTB.
File extensions: would it make sense to have a recommended file extensions for the various new file formats (instead of reusing .otf and .woff2)?

a. Related we almost certainly want to introduce a new format(...) for IFTB files in @font-faces. This would allow for feature detection in the browser (eg. client who doesn’t understand IFTB files will skip loading and fallback to the next URL).
The feature chunk mapping mechanism allows one to map from (feature, chunk idx) -> additional chunk idx. However, this is a bit limiting. If we had the ability to map from (feature, {chunk idx set}) -> additional chunk idx. Then it would be possible to conditionally include glyphs that are only needed when combinations of chunks are present (eg. ligature that can only be activated when glyphs from two different chunks are available). This would be optional, a simpler encoder could simply not utilize it (sticking with the simpler (feature, chunk idx) approach). An advanced encoder could likely utilize this to produce more granular chunks when encountering multi glyph substitution rules.
In the patch subset specification when referring to integrating with browsers we did so by referencing the fetch spec (eg. to describe how to make a request). In this spec since the integration with the browser is a lot simpler you might be able to get away with not even referencing browsers at all. Just focus on describing the operations that a client could do (initial request, extending the font via loading chunks).

garretrieger · 2023-09-05T22:15:58Z

IFTB.bs

+TR: https://www.w3.org/TR/IFTB/
+ED: https://w3c.github.io/IFT/IFTB.html
+Editor: Chris Lilley, W3C, https://svgees.us/, w3cid 1438
+Editor: Myles C. Maxfield, Apple Inc., mmaxfield@apple.com, w3cid 77180


You should remove Myles and add yourself as an editor.

garretrieger · 2023-09-05T22:19:04Z

IFTB.bs

+     "chunk set" (which could be a bitmap or std::vector<bool> indexed by chunk index.
+5.   The browser then look up each layout feature in the font subset description in the IFTB table
+     featureTable. That table maps the initial GID-mapped chunks to higher-indexed feature-specific
+     chunks. If any chunk in the set maps to a feature-specific-chunk the latter is added to the set.


Can you clarify here if chunks that are added by a feature are allowed to trigger further additions on later features? This process will need to be very precisely defined so that multiple implementations will always arrive at the same result.

garretrieger · 2023-09-05T22:21:44Z

Overview.bs

 ===========================================================

 <!-- TODO: remove obsolete tag once the separate range request spec is published -->
-Range request incremental font transfer is specified in a separate document: [[RangeRequest obsolete]]
+Binned Incremental Font Transfer is specified in a separate document: [[IFTB obsolete]]


[[IFTB]] won't work until we've commited the IFTB document. As a workaround you can instead use a regular old link <a href="IFTB.html">...</a> until the document is committed.

garretrieger · 2023-09-05T22:23:50Z

Overview.bs

+static arrangement
+of bins makes it more compatible with caching, including regional caching. ("More compatible" in the
+sense that chunk files will see a higher cache hit rate compared with subset and patch files.) All IFTB
+data is compressed at Brotli level 11 upfront.  IFTB transfers all other tables


Brotli level 11 is an implementation detail of the current open source brotli encoder, but not actually something part of the brotli standard. Instead maybe just say maximum quality?

garretrieger · 2023-09-05T22:25:50Z

Overview.bs

@@ -229,6 +255,10 @@ Opt-In Mechanism {#opt-in}

 <em>This section is general to both IFT methods.</em>

+(XXX Because IFTP is a protocol and IFTB is a format, I suspect most of this section and the related
+technology questions are superfluous.)


Agreed I think we'll end up with tech(incremental-patch) used with patch subset and format(new-iftb-format-name) for IFTB. Then there's no need for the incremental-auto/incremental-range mechanism and related text.

I agree but am leaving this as-is for now.

garretrieger · 2023-09-05T22:45:57Z

IFTB.bs

+Offset32   CFFCharStringsOffset  - 0 if glyf-based
+Offset32   gidMapOffset
+Offset32   chunkOffsetListOffset
+Offset32   featureMapOffset


Here and elsewhere Offset fields are used be sure to mention what those offsets are relative too.

garretrieger · 2023-09-05T22:47:54Z

IFTB.bs

+The chunk set is a bit array indicating whether the corresponding chunk is
+present.  The bits for chunks 0 through 7 are in chunkSet[0], those for 8
+through 15 are in chunkSet[1], and so on.  Within a byte the lowest chunk index
+is represented by the 1s bit, then the 2s, then the 4s, and so on.


To be as unambiguous as possible it's helpful to define the mapping to bits within a byte using the most and least significant bit (eg. the least significant bit is chunk 0, the most signifcant bit is chunk 7)

garretrieger · 2023-09-05T22:49:05Z

IFTB.bs

+file.  The string must contain substrings of "$1", "$2", "$3", "$4" and/or "$5", which must be replaced
+with the corresponding hexidecimal digits of the chunk index ("$1" being the ones digit, "$2" being the
+sixteens digit, and so on) to get the relative URI of the chunk. (This can then be combined with the
+URL of the initial IFTB font file to produce the absolute URL of the chunk.)


What is the encoding of the strings? (eg. ascii? utf-8?)

I think we should adopt whatever currently makes sense from a w3c perspective. I believe URIs were ASCII for a long time but I don't know if that's still true. We should also be careful about the dollar signs.

https://datatracker.ietf.org/doc/html/rfc6570 is probably relevant here. A standardized way to have template parameters in a URL and it also talks about encoding in section 1.6.

I looked through this. Given that IFTB would/will be a w3c spec maybe we should consult @svgeesus at a future meeting about the template format. The powers that be might prefer something more elegant than my dollar-sign expansion (although the spec allows for it).

For the time being I've been assuming the encoding will wind up being either ASCII or whatever corresponds to an "encoded" url.

It's should be OK to reference an IETF RFC in this spec, we reference quite a few in the patch subset specification. You can add reference like this: [[RFC6570]]. Agreed we can discuss this further at the next meeting.

garretrieger · 2023-09-05T22:51:00Z

IFTB.bs

+* The set of glyphs contained in the chunks loaded through the GID and feature maps must be a superset
+    of those in the GID closure of the font subset description.
+
+The encoder has three options for addressing with joint dependencies on individual glyphs:


I would clearly mark this part (the 3 options below) as being non-normative since it's just providing advice on how an encoder might be built and not actually laying out requirements that an encoder must follow.

Or another option this could be re-framed as things that the encoder may do which are valid. For example it's valid to place a glyph in more than one chunk if needed.

garretrieger · 2023-09-05T22:52:48Z

IFTB.bs

+     process, it can be moved to bin 0 where it will always be included in the initially loaded file.
+
+Glyph Bin Locality {#iftb-bin-locality}
+---------------------------------------


This section is probably also non-normative.

garretrieger · 2023-09-05T22:58:30Z

The initial file can be large (even after compression, I saw 200k-400k in the fonts that are part of the demo). Part of this is because all non-glyph data is transferred in full as part of the initial file. I wonder if it might make sense to try and have data from additional tables that are indexed by glyph id sent in the chunk files (for example vmtx/hmtx). See a breakdown of table sizes in the initial file here: https://docs.google.com/spreadsheets/d/1AK43VvY4LAEuttNPLoXulmedjRs8y44A4ZtN0lcT2QU/edit?resourcekey=0-uRS36zkMiyBmi8fG_U70ZA#gid=0

One additional thought regarding this: another approach that could be used to reduce the size of the initial file would be to split the font into a small number of subsets where possible and have each individual subset then augmented by IFTB (using unicode-range in CSS to select the subsets that are needed). For example in the CJK case you could (if it wouldn't break layout rules) split the font into a high usage and low usage subset. Then if a client doesn't need anything from the low usage subset it would only download and augment the high usage subset thereby saving from having to transmit all of the layout and metric information from the low usage glyphs. I don't think spec changes are needed to accommodate this, but probably worth mentioning in the section talking about optimizing the encoder.

skef · 2023-09-07T19:59:40Z

Starting to work through some of these comments ...

It’s likely that the memory usage for the unpacked font will be quite high (eg. Noto Serif SC base file is ~1 mb uncompressed). This is partially because the format retains gids and ends up with lots of zero’d out regions for glyphs that aren’t yet present. I wonder if there are some ways that we can avoid doing this? I suspect browsers are going to be pretty concerned about overall memory usage.

After thinking about this question for a while I've arrived at some philosophy, or perhaps ideology, for myself: The role or purpose of IFT is to try to improve the network transfer of font data. Accordingly, when considering local storage size we should generally be following developments in general font technology, or influencing such general development but not worry about or attempt unilateral improvements.

There are lots of potential ways of organizing a font file. If one were primarily concerned about persistent storage size, one could store the whole file compressed. If one were primarily concerned about in-memory size, one could store most of the per-glyph data together, perhaps on power-of-two boundaries, so that it could be easily paged in. If one were worried about both, one could do both: Store the per-glyph data in compressed, possibly page-aligned chunks, loading and decompressing only those chunks needed at a given time.

The general direction of font development in recent years has gone in the opposite direction. CFF had the advance in the glyph, glyf has phantom points. OpenType has moved much of this data into separate tables for better access during shaping. And although there is currently a component proposal to the ISO ad-hoc group for reducing file size, it chose to leave htmx and vtmx as they were.

But IFTB is saving substantial amounts of space -- that of any un-merged glyph data. And that it doesn't do so for other tables like hmtx and vmtx is pretty inherent to its GID-preservation-based design. Changing GIDs would (I think) require building knowledge of basically all the shaping tables back into the client-side, making it more complicated than range-request would have been. And experimenting with run-length-encoded h/vmtx, and therefore requiring shapers to be updated to support those formats, seems like beyond the scope of our project.

I suppose the counter-argument would be that web fonts are more ephemeral on a given system than system fonts, so there is more to save space. But I'm not convinced on that basis.

skef · 2023-09-07T20:02:48Z

Related to this there’s currently a requirement to change the first 4 bytes of the version tag on the font file before passing the font to a rendering process. This may end up necessitating a full copy of the fonts data prior to passing the font on to the rendering process further increasing memory usage. We might consider relaxing this requirement if the rendering process is aware of the IFTB version tag and capable of correctly processing the file.

You're right, this shouldn't be a requirement, nor (as I advocated several months ago) should recalculating the checksums be a requirement.

How about we change the language to suggest that clients be enhanced to use the version directly and ignore the checksums, but indicate that in contexts where that is not possible/desirable the client implementation can make the necessary adjustments?

skef · 2023-09-07T20:19:51Z

The initial file can be large (even after compression, I saw 200k-400k in the fonts that are part of the demo). Part of this is because all non-glyph data is transferred in full as part of the initial file. I wonder if it might make sense to try and have data from additional tables that are indexed by glyph id sent in the chunk files (for example vmtx/hmtx).

I am open to this discussion but we should think about the implications, not currently discussed in my added documentation (as far as I remember).

With IFT you have initial loads and augmentations. Each of these involves retrieving data you need to render the page correctly. There is an already well-established related question of what to do in the mean time. What many browsers do is render with fallback fonts, resulting in the oft-complained-about web font "flash" or "flicker".

If we continue to include the shaping data in IFTB browsers will have more rendering options, particularly when it comes to augmentation, because they will be able to arrive at the final page layout once they have the base file. Backup glyphs might then be temporarily coerced into the metrics of the IFTB font and replaced when loaded.

Beyond that specific consideration, I'm not sure IFTB will ever be the right format if you're worried about 40k here or 80k there out of initially huge font files like the Noto fonts, just because you will almost inevitably be loading many more glyphs than you'll need for rendering, which will overwhelm those numbers. And looking at your spreadsheet, The plurality of the initial compressed font size is still in the CFF (or glyf/gvar for ttf) table, which I suspect is mostly the result of my crappy encoder. We should think about these files as having a much smaller initial percentage of glyphs.

skef · 2023-09-07T20:23:58Z

Font collections: when developing patch subset a concern was raised about making sure it supported font collections. So we’ll possibly need to think about what it would take to support font collections in IFTB.

Seems like a huge mess but we can talk about it.

skef · 2023-09-07T20:29:21Z

File extensions: would it make sense to have a recommended file extensions for the various new file formats (instead of reusing .otf and .woff2)?

".otf" seems easy enough to replace if we want to. I'm more reluctant to move away from ".woff2" for the compressed files for two reasons:

WOFF2 already has a separate, internal version number and the currently IFTB use of woff2 is entirely spec-compliant.
I worry that if we introduce a new extension for the compressed version that there will be years and years of servers pointlessly gzipping (or whatever) compressed IFTB files followed by clients pointlessly un-gzipping them.

skef · 2023-09-07T20:37:07Z

The feature chunk mapping mechanism allows one to map from (feature, chunk idx) -> additional chunk idx. However, this is a bit limiting. If we had the ability to map from (feature, {chunk idx set}) -> additional chunk idx. Then it would be possible to conditionally include glyphs that are only needed when combinations of chunks are present (eg. ligature that can only be activated when glyphs from two different chunks are available). This would be optional, a simpler encoder could simply not utilize it (sticking with the simpler (feature, chunk idx) approach). An advanced encoder could likely utilize this to produce more granular chunks when encountering multi glyph substitution rules.

For "traditional" ligature scenarios I'm not sure I'm convinced of the benefit. Are there cases where you'll really be able to group enough related ligatures together to warrant an independent chunk? Or are you thinking it would be worth having single-glyph chunks just for this case?

Maybe looking at some specific ligature scenarios would help convince me.

OTOH I can see how something like this might be more directly relevant to emoji fonts. I haven't dug into emoji specifics very much in thinking about IFTB.

skef · 2023-09-07T23:37:27Z

I made some updates

skef added 4 commits August 22, 2023 00:54

start

8d40fef

More

e87e4c1

overview changes

a256a09

Try and fail to get make to work

767c8ac

garretrieger reviewed Sep 5, 2023

View reviewed changes

Review changes

ea53200

garretrieger mentioned this pull request Feb 8, 2024

Add a new static incremental font transfer spec to replace patch subset and range request. #155

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IFTB draft #151

IFTB draft #151

skef commented Aug 22, 2023 •

edited by pr-preview bot

svgeesus commented Aug 22, 2023

garretrieger commented Sep 5, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

skef Sep 7, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

skef Sep 7, 2023

garretrieger Sep 7, 2023

skef Sep 7, 2023

garretrieger Sep 7, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

garretrieger Sep 5, 2023

garretrieger commented Sep 5, 2023 •

edited

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

IFTB draft #151

Are you sure you want to change the base?

IFTB draft #151

Conversation

skef commented Aug 22, 2023 • edited by pr-preview bot

svgeesus commented Aug 22, 2023

garretrieger commented Sep 5, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

garretrieger commented Sep 5, 2023 • edited

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Sep 7, 2023

skef commented Aug 22, 2023 •

edited by pr-preview bot

garretrieger commented Sep 5, 2023 •

edited