don't manually grapheme align ts highlights #10310

pascalkuthe · 2024-04-08T21:34:21Z

I have known the root cause of #6645 for a while: The c grammar has a bug where it sometimes creates non-grapheme aligned highlight spans when cealing with unicode characters like emojois.

This caused crashes because our grapheme alignment code couldn't deal with non-char aligned offsets. I think that is the right (potentially only valid) choice for doing grapheme alignment.

In our case it was unfortunate as it leads to crashes. I also wanted to get rid of this grapheme alignment anyway because we already render the text one grapheme at a time, so it's really unnecessary to do this. However, in the past it wasn't possible to remove the alignment because the additional highlight iterators merged on top of the base iterator were using char indexing/grapheme aligned indexes. Just switching the TS iterator would have caused highlighting bugs.

However, recently the overlay and base highlights were decoupled so now the base highlights are seperate, and it was possible to convert the base iterator to using byte indexing instead

helix-term/src/ui/document.rs

helix-term/src/ui/editor.rs

helix-term/src/ui/document.rs

helix-stdx/src/rope.rs

helix-term/src/ui/document.rs

pascalkuthe added C-bug Category: This is a bug E-easy Call for participation: Experience needed to fix: Easy / not much A-helix-term Area: Helix term improvements S-waiting-on-review Status: Awaiting review from a maintainer. C-perf labels Apr 8, 2024

archseer reviewed Apr 9, 2024

View reviewed changes

helix-term/src/ui/document.rs Outdated Show resolved Hide resolved

helix-term/src/ui/document.rs Outdated Show resolved Hide resolved

helix-term/src/ui/editor.rs Show resolved Hide resolved

helix-term/src/ui/document.rs Show resolved Hide resolved

pascalkuthe force-pushed the non_unicode_crash branch from 28f6e30 to 88f456a Compare April 9, 2024 10:00

the-mikedavis reviewed Apr 9, 2024

View reviewed changes

helix-stdx/src/rope.rs Outdated Show resolved Hide resolved

helix-term/src/ui/document.rs Outdated Show resolved Hide resolved

helix-term/src/ui/document.rs Outdated Show resolved Hide resolved

don't manually grapheme align ts highlights

d31c09f

pascalkuthe force-pushed the non_unicode_crash branch from bb63ee9 to d31c09f Compare April 9, 2024 21:14

the-mikedavis approved these changes Apr 9, 2024

View reviewed changes

archseer merged commit 73d26d0 into master Apr 10, 2024
6 checks passed

archseer deleted the non_unicode_crash branch April 10, 2024 15:14

postsolar pushed a commit to postsolar/helix that referenced this pull request Apr 20, 2024

don't manually grapheme align ts highlights (helix-editor#10310)

63797cc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

don't manually grapheme align ts highlights #10310

don't manually grapheme align ts highlights #10310

pascalkuthe commented Apr 8, 2024

don't manually grapheme align ts highlights #10310

don't manually grapheme align ts highlights #10310

Conversation

pascalkuthe commented Apr 8, 2024