Allow `tokenizer` to pause and resume in the middle of a line #1444

Guldoman · 2023-03-27T19:08:40Z

For now the pause policy is that every 200 tokenized chars, if half a frametime has passed, it "yields".
We might consider allowing the customization of the pause policy.

The yielding behavior is not implemented with coroutines because of performance concerns around creating them. I might test if this is a valid concern.

Only tokenization coroutines are allowed to tokenize whole lines.
Highlighter:get_line only tokenizes the first 200 characters, plus anything else it can in the allowed time frame.
This is done because that function can be called multiple times per frame, which could tank the performance. We might consider adding a parameter to allow resuming the tokenization.

When the tokenizer pauses, the remaining text uses the incomplete token type. Suggestions for a different name are welcome.

We sadly can't resume the tokenization of a line from where it changed, and need to restart the tokenization from scratch.
This is because the change itself could match with a rule that applies before it.

Testing is very welcome.

adamharrison · 2023-04-01T15:47:56Z

For now the pause policy is that every 200 tokenized chars, if half a frametime has passed, it "yields".
We might consider allowing the customization of the pause policy.

I think it's fine; very few people are going to customize this. If anyone wants to, they can complain, and we can put in a config variable, but I really don't think it's necessary.

The yielding behavior is not implemented with coroutines because of performance concerns around creating them. I might test if this is a valid concern.

I honestly think it's fine; it's clear enough, and not a huge amount of code.

Only tokenization coroutines are allowed to tokenize whole lines.
Highlighter:get_line only tokenizes the first 200 characters, plus anything else it can in the allowed time frame.
This is done because that function can be called multiple times per frame, which could tank the performance. We might consider adding a parameter to allow resuming the tokenization.

Seems reasonable.

When the tokenizer pauses, the remaining text uses the incomplete token type. Suggestions for a different name are welcome.

I'm good with this.

We sadly can't resume the tokenization of a line from where it changed, and need to restart the tokenization from scratch. This is because the change itself could match with a rule that applies before it.

Honestly, that's fine, and reasonable for now. If it becomes an issue with people who really want to edit really long lines, while tokenizing them, we can always revisit.

I'm going to do a bit more testing, and if it looks good, I'll merge.

adamharrison · 2023-04-01T16:12:36Z

Yeah, tested; looks good. Merging.

…l#1444)

github-actions bot added the Category: Lua Core label Mar 27, 2023

Guldoman linked an issue Mar 27, 2023 that may be closed by this pull request

Lite XL really hates really long lines #1369

Closed

Guldoman force-pushed the PR_yieldable_tokenizer branch 3 times, most recently from d54c3f5 to a1035c8 Compare March 27, 2023 23:20

Allow tokenizer to pause and resume in the middle of a line

4ff817e

Guldoman force-pushed the PR_yieldable_tokenizer branch from a1035c8 to 4ff817e Compare March 29, 2023 00:04

adamharrison merged commit 86fa90a into lite-xl:master Apr 1, 2023
8 checks passed

takase1121 pushed a commit to takase1121/lite-xl that referenced this pull request Aug 19, 2023

Allow tokenizer to pause and resume in the middle of a line (lite-x…

6d21720

…l#1444)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow `tokenizer` to pause and resume in the middle of a line #1444

Allow `tokenizer` to pause and resume in the middle of a line #1444

Guldoman commented Mar 27, 2023

adamharrison commented Apr 1, 2023

adamharrison commented Apr 1, 2023

Allow tokenizer to pause and resume in the middle of a line #1444

Allow tokenizer to pause and resume in the middle of a line #1444

Conversation

Guldoman commented Mar 27, 2023

adamharrison commented Apr 1, 2023

adamharrison commented Apr 1, 2023

Allow `tokenizer` to pause and resume in the middle of a line #1444

Allow `tokenizer` to pause and resume in the middle of a line #1444