-
-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Description
Manish Goregaokar explains this better than I ever will: https://manishearth.github.io/blog/2017/01/14/stop-ascribing-meaning-to-unicode-code-points/. The issues described in his blog post also show up in kitty. For example, ZWJ's are rendered as <200d> on the prompt and ignored in the output. This is what echo "👨👩👧👦" looks like on the prompt:
$ echo "👨<200d>👩<200d>👧<200d>👦"
👨👩👧👦
On the other hand 👩🏽 renders correctly but takes up 2 cell blocks (both in the prompt and the output) so this is consistent with #461 (comment).
For text in non-latin scripts there are also some weird things happening: 각 shows up in the prompt as ᄀ<1161><11a8> but correctly in the output (and it only takes up one cell, even though it is made up of three codepoints, so this is not consistent with #461 (comment)).
All in all this is probably a (really) hard problem to solve, but it would be greatly appreciated if you could look into it.