examples: add `shaped-text2svg` for generating SVGs from shaped Unicode text. #70

eddyb · 2023-06-22T18:19:00Z

This is based on the ttf-parser example font2svg, and uses a combination of unicode-bidi and rustybuzz on top of it, to offer a relatively compact but (hopefully) complete usage example for rustybuzz.

While discussing such a self-contained "complete example" with @Manishearth, he mentioned that it may be possible for rustybuzz to offer a "complete Unicode bidirectional shaping solution", to avoid having the user correctly use unicode-bidi etc.

Using Go Noto Universal 6.0's GoNotoCurrent.ttf, and the UDHR, I was able to get some examples:
_{(all images are chosen samples, with links above them for the full original version, due to GitHub limitations)}

Lang	`shaped-text2svg` output	diff w/ browser rendering
`eng`	^{full SVG}	^{full HTML} ^{(--- are misaligned - all languages hit this)}
`arb`	^{full SVG}	^{full HTML} ^{(Latin glyphs appear to misalign Arabic ones)}
`hin`	^{full SVG}	^{full HTML} ^{(no idea what's going on here, more investigation needed)}
`cmn_hans`	^{full SVG}	^{full HTML} ^{((III) confirmed to shape differently in browser vs rustybuzz)}

A few notes about that that diff in the last column:

I haven't published the script I'm using because it's frankly a mess and less automated than I'd like, but I suspect some people might want it even integrated into the example itself (or at least available somewhere)
I'm overlapping the SVG and HTML text 1:1 and using CSS mix-blend-mode: difference;
it's not perfect because of what I assume is anti-aliasing/fine-positioning differences between the SVG paths and the HTML text, but it's close enough that you only see the outline (i.e. where grayscale anti-aliasing is used, not the fill of the glyphs) when shaping matches "perfectly"
something weird is going on with this font and its browser rendering of e.g. (III), compared to rustybuzz
- (III) has all 5 glyphs aligned at the top in the browser, but vertically centered in rustybuzz
- I think a lot of the mismatches are just oddities like that entirely confined to ASCII/Latin, which then cause the rest of the non-ASCII/Latin text to be misaligned
- hopefully this is just me misusing rustybuzz and/or ttf-parser APIs, but at this point I'm not sure

TODO: try more languages, maybe emoji (hard to mix emoji & non-emoji w/o font fallback), try to improve diffing against browser rendering

…de text.

eddyb · 2023-06-23T05:00:35Z

Update: I've narrowed down most of the weird differences caused by ASCII to locl - some differences go away if I do font-feature-settings: "locl" 0; in the browser and likewise disabling locl in rustybuzz.

Another way to control this is with the lang property in the browser, if I do document.body.lang = "zh" on the cmn_hans example, all the differences in the bulk of the text go away, and new differences appear in the English header at the top.

At this point I would have to port this example to use harfbuzz to be able to tell, but I suspect the default of leaving the language unset is simply different from what browsers do (which may be using additional heuristics?).

EDIT: given that I see no changes when I force en on either side, I think that's quite literally the default (or equivalent to it in whatever OpenType terms) and there's a behavior mismatch within it, without browsers doing anything more sophisticated.

RazrFalcon · 2023-06-23T08:38:47Z

Oh wow, thanks! Wasn't expecting someone to dive into this. I was planning to write something like this myself, by didn't had time.

I'm not sure we need full browser compatibility in this demo/example. Even resvg has a far simpler implementation. And it's the reason rustybuzz exists.

As for language and bidi - harfbuzz/rustybuzz are pretty low-level libraries. You cannot use them directly. You do need a text layout library on top of them. Like pango on Linux.

Honestly, I'm not even sure we need bidi in this example. Either way, it's good enough for me already. And you want to improve it a bit - I do not mind. But we should not try implementing a text layout library in a simple example.

Manishearth · 2023-06-23T09:11:21Z

I would recommend having bidi in the example because it's a useful illustration of all the parts needed to handle text right, and prevents people from using the library naively.

(And because bidi is weird and complicated and the integration of a bidi algorithm implementation with a shaping engine is not necessarily immediately obvious)

RazrFalcon · 2023-06-23T09:17:43Z

@Manishearth Depending on you definition of a text layout, one can have thousands lines of code on top of rustybuzz.
Sure, I don't really mind having bidi in this example, but it's still pretty far from a proper text layout.

I do have plans on writing an easy to use text layout/rustybuzz wrapper eventually, but time is not on my side.

prevents people from using the library naively

Meanwhile I keep telling people to stop using rustybuzz... In a sense that it must not be used directly. You do need a higher level wrapper for it.

ebraminio · 2024-08-19T16:51:57Z

This, specially integrating bidi, is very useful, thank you @eddyb, I wish you can turn integrate COLRv1 of newer versions of ttf-parser's font2svg https://github.com/RazrFalcon/ttf-parser/blob/master/examples/font2svg.rs and maybe releasing it even in a separate crate if possible. While ago I wanted to build some web service font rendering for an open source website (which the proposal there didn't go that well) and specially I didn't know what to do for that bidi part as all I did was copying font2svg and wrapping it in a service https://github.com/ebraminio/fontoid/blob/main/src/main.rs taking one font as input and being able to bidi and COLRv1 and output SVG was all I wanted which you've done that already and I wish you can turn it to something better, either in this repo or somewhere else. Thanks

examples: add shaped-text2svg for generating SVGs from shaped Unico…

ce63ed7

…de text.

eddyb mentioned this pull request Jun 27, 2023

Replace makepad_vector::ttf_parser with ttf-parser crate. makepad/makepad#188

Merged

eddyb mentioned this pull request Aug 3, 2023

draw: integrate rustybuzz for shaping text. makepad/makepad#195

Merged

RazrFalcon mentioned this pull request Jul 12, 2024

Create equivalents for hb-view and hb-shape #130

Closed

ebraminio mentioned this pull request Aug 19, 2024

Integrate bidi ebraminio/fontoid#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples: add `shaped-text2svg` for generating SVGs from shaped Unicode text. #70

examples: add `shaped-text2svg` for generating SVGs from shaped Unicode text. #70

eddyb commented Jun 22, 2023 •

edited

Loading

eddyb commented Jun 23, 2023 •

edited

Loading

RazrFalcon commented Jun 23, 2023 •

edited

Loading

Manishearth commented Jun 23, 2023 •

edited

Loading

RazrFalcon commented Jun 23, 2023

ebraminio commented Aug 19, 2024

examples: add shaped-text2svg for generating SVGs from shaped Unicode text. #70

Are you sure you want to change the base?

examples: add shaped-text2svg for generating SVGs from shaped Unicode text. #70

Conversation

eddyb commented Jun 22, 2023 • edited Loading

eddyb commented Jun 23, 2023 • edited Loading

RazrFalcon commented Jun 23, 2023 • edited Loading

Manishearth commented Jun 23, 2023 • edited Loading

RazrFalcon commented Jun 23, 2023

ebraminio commented Aug 19, 2024

examples: add `shaped-text2svg` for generating SVGs from shaped Unicode text. #70

examples: add `shaped-text2svg` for generating SVGs from shaped Unicode text. #70

eddyb commented Jun 22, 2023 •

edited

Loading

eddyb commented Jun 23, 2023 •

edited

Loading

RazrFalcon commented Jun 23, 2023 •

edited

Loading

Manishearth commented Jun 23, 2023 •

edited

Loading