refactor(treesitter): rely more on ts correctness #18109

vigoux · 2022-04-14T12:24:01Z

@bfredl, this removes an old part of the code that was added a long
time ago.

Now comes the question of actually using ephemeral extmarks, and
rather use persistent extmarks and rely on tree updates.

I just leave that there...

bfredl · 2022-04-14T12:58:19Z

Why remove the state between lines? That seems like a pure regression from a performance perspective (also it just is wrong, we redraw a line range, not a line at a time).

vigoux · 2022-04-14T13:02:07Z

Why remove the state between lines? That seems like a pure regression from a performance perspective (also it just is wrong, we redraw a line range, not a line at a time).

Because, as most of my fellow developer, I don't have a terrifically deep understanding of the drawing loop.

Why not sending the end line in the callback to the redrawing ?

vigoux · 2022-04-14T13:03:15Z

And what about using static extmarks instead of a complex highlight state ? That will make a huge part of the code simpler to understand.

bfredl · 2022-04-14T13:06:15Z

Because, as most of my fellow developer, I don't have a terrifically deep understanding of the drawing loop.

Any perf related change must be based on "understanding", other wise it is just a random hit or miss. Profiling can help to get empirical understanding (I have used this to get rid of bottlenecks in unexpected places, such as strequal() ).

Why not sending the end line in the callback to the redrawing ?

That's on_win, and could be alternative, but then the hlstate array might get much longer (at the point we could be starting to see O(n^2) behavior there, but likely only for real big windows).

vigoux · 2022-04-14T13:08:10Z

Any perf related change must be based on "understanding", other wise it is just a random hit or miss. Profiling can help to get empirical data.

Running the test showed a slight improvement in performances.

If you don't want the changes, just close that PR, I don't care, I just wanted to raise a point (I did twice) about static extmarks, and you plain ignored it twice.

bfredl · 2022-04-14T13:10:33Z

If you don't want the changes, just close that PR, I don't care, I just wanted to raise a point (I did twice) about static extmarks, and you plain ignored it twice.

simply because I needed to think a bit more about that? you posted it 37 minutes ago, and I don't see the hurry, sorry. it could be simpler, though I like the current code that it doesn't duplicate state already in the tree in a second static copy, so I think that is a point for simplicity as well.

vigoux · 2022-04-14T13:13:02Z

I think that is a point for simplicity as well.

That is a valid point. Although I think that this is an instance where this simplicity might lead to a bit too much work on the drawing side.

I think we'd need real data in order to compare this in real life. I remember I did a similar change a while back, and that lead to significant performance improvements.

bfredl · 2022-04-14T13:15:40Z

That is a valid point. Although I think that this is an instance where this simplicity might lead to a bit too much work on the drawing side.

Sure but OTOH if you pre-render whenever the tree is changed, you do more work upfront instead (which might not end up being used). I suspect the trade-off varies greatly how much text-editing vs scrolling around the user does.

vigoux · 2022-04-14T13:16:41Z

Good point, maybe we need something that redraws the static extmarks in on_win ?

vigoux · 2022-04-15T13:09:54Z

@bfredl coming back at this, I am able, with this version of the implementation to limit how far we go on the number of columns.

An even better way of doing this (that I don't know though), would be to pass the very specific byte offsets to the on_line callback, so that we only draw the very portion of the line that the user sees.

vigoux · 2022-04-15T13:11:57Z

Thus this version should also fix #14852

vigoux · 2022-05-02T11:38:04Z

Any update on this PR ? I think it is already a step forward in fixing bugs...

vigoux · 2022-05-03T08:26:37Z

I have added the doubling of the match limit value, which should give more room for queries to run.
I think that the PR is ready this way. @bfredl if you want to give a final opinion, I'd be glad to have it.

vigoux · 2022-05-03T08:27:51Z

For the record, this fixes:
nvim-treesitter/nvim-treesitter#1788
nvim-treesitter/nvim-treesitter#1506

bfredl · 2022-05-03T08:47:13Z

Now that we have untangled the correctness from perf aspect, perhaps we could break out the former to a separate PR? I e more or less the changes to treesitter.c (including the free list, as that is more or less obvious)

clason · 2022-05-03T08:58:32Z

That is a good idea, if only because the fix should definitely be backported to 0.7.

vigoux · 2022-05-03T09:39:49Z

I am in favor of this, what parts do you want to separate:

free list + synmaxcol
iterator reset + doubling?

This way we have the fix on one hand a'd the feature on the other?

clason · 2022-05-03T09:44:33Z

I'd say put the kväck and the doubling in a separate fix(treesitter) PR.

lewis6991 · 2022-06-16T09:08:35Z

For reference, the testcase from lewis6991/gitsigns.nvim#575 is pretty simple:

nvim +'set foldmethod=manual' +'1000,5000fold' +'999' ./src/nvim/eval.c

Move cursor up and down

vigoux · 2022-06-16T09:11:21Z

Reproducible, I am going to attempt a little benchmark somehow.

vigoux · 2022-06-16T09:44:20Z

I am somehow not able to reproduce this using nvim --clean and that sounds weird to me.

lewis6991 · 2022-06-16T09:50:28Z

Try making the fold as large as possible.

vigoux · 2022-06-16T09:53:41Z

I finally got a benchmark, as follows:

VIMRUNTIME=$(pwd)/runtime/ time nvim --clean -u init.lua ./src/nvim/eval.c

-- init.lua
local parser = vim.treesitter.get_parser(0, "c", {})
local highlighter = vim.treesitter.highlighter.new(parser)

local function keys(keys)
  vim.api.nvim_feedkeys(keys, 't', true)
end

vim.opt.foldmethod = "manual"
vim.opt.lazyredraw = false

vim.cmd [[1000,7000fold]]
vim.cmd [[999]]

local function mk_keys(n)
  local acc = ""
  for i = 1, n do
    acc = acc .. "j"
  end
  for i = 1, n do
    acc = acc .. "k"
  end

  return "qq" .. acc .. "q"
end

keys(mk_keys(10))

for i = 1, 100 do
  keys "@q"
  vim.cmd[[redraw!]]
end

vim.cmd [[quit!]]

With this PR : 0:01.10
Current master: 0:03.11

That's a 3x improvement in performance.

bfredl · 2022-06-16T10:11:17Z

I suppose the presence of a (large) fold could be detected by comparing difference of line param and previous value with some threshold?

vigoux · 2022-06-16T10:14:56Z

What's a "large enough" fold ? I think that the data above is a clear indicator of a perf improvement.
I am working on more benchmarks, in order to bring more proofs of the perf improvement.

bfredl · 2022-06-16T11:28:27Z

idk, as a simple starting point we can just do it for any fold (so whenever delta > 1)

lewis6991 · 2022-06-16T14:36:53Z

Here's my results using the benchmark above.

	Time
master	`5.168`
This PR	`1.528`
#18760	`0.247`

So 3x improvement with this PR, but 20x with #18760!!!

So there's some evidence that re-using the iterator is worth doing.

clason · 2022-06-16T14:52:12Z

We really need a dedicated benchmark test suite (make performancetest), not necessarily for CI, but both for such performance PRs and for regression testing with git bisect.

A PR or dedicated issue for discussing this welcome!

lewis6991 · 2022-06-16T15:08:38Z

Ding #18989

add benchmark from #18109

add benchmark from neovim#18109

github-actions bot added lua stdlib treesitter refactor changes that are not features or bugfixes labels Apr 14, 2022

github-actions bot requested a review from bfredl April 14, 2022 12:24

vigoux force-pushed the update_highlighter branch from 9535cec to bce8bcf Compare April 14, 2022 12:35

vigoux changed the title ~~refactor(ts): rely more on ts correctness~~ refactor(treesitter): rely more on ts correctness Apr 14, 2022

clason added this to the 0.8 milestone Apr 14, 2022

vigoux force-pushed the update_highlighter branch from d29e373 to 0edc5f9 Compare April 15, 2022 13:08

vigoux force-pushed the update_highlighter branch from 0edc5f9 to d28086f Compare May 2, 2022 11:41

vigoux mentioned this pull request May 3, 2022

HTML/svelte syntax highligh disappears when scrolling a large file nvim-treesitter/nvim-treesitter#1788

Closed

3 tasks

vigoux force-pushed the update_highlighter branch 2 times, most recently from 06a0a01 to 2fecc69 Compare May 3, 2022 08:26

perf(treesitter): remove unnecessary check

2be824d

vigoux force-pushed the update_highlighter branch from 55a3706 to 2be824d Compare June 16, 2022 09:08

akinsho mentioned this pull request Jun 16, 2022

[WIP] Better Neorg Performance nvim-neorg/neorg#473

Closed

lewis6991 added a commit to lewis6991/neovim that referenced this pull request Jun 16, 2022

test(treesitter): add benchmark from neovim#18109

48bb09d

lewis6991 added a commit to lewis6991/neovim that referenced this pull request Jun 16, 2022

test(treesitter): add benchmark from neovim#18109

48781fe

lewis6991 added a commit to lewis6991/neovim that referenced this pull request Jun 16, 2022

test(treesitter): add benchmark from neovim#18109

cddfa8c

justinmk pushed a commit that referenced this pull request Jun 17, 2022

test(treesitter): add benchmark #18989

e0aa1d8

add benchmark from #18109

This comment was marked as off-topic.

Sign in to view

justinmk mentioned this pull request Jun 19, 2022

lagging cursor movement in big json with tree-sitter #14852

Closed

kraftwerk28 pushed a commit to kraftwerk28/neovim that referenced this pull request Jul 6, 2022

test(treesitter): add benchmark neovim#18989

ee68b60

add benchmark from neovim#18109

marcelbeumer mentioned this pull request Jul 16, 2022

[Go] performance degrades (input delay) when file gets bigger nvim-treesitter/nvim-treesitter#3187

Closed

smjonas pushed a commit to smjonas/neovim that referenced this pull request Dec 31, 2022

test(treesitter): add benchmark neovim#18989

4a97150

add benchmark from neovim#18109

zeertzjq removed the lua stdlib label Mar 22, 2023

clason modified the milestones: 0.9, 0.10 Mar 31, 2023

lewis6991 mentioned this pull request Sep 16, 2023

perf(treesitter): do not scan past given line for predicate match #25188

Merged

dundargoc modified the milestones: 0.10, backlog Mar 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(treesitter): rely more on ts correctness #18109

refactor(treesitter): rely more on ts correctness #18109

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022

vigoux commented Apr 14, 2022

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022 •

edited

Loading

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022

vigoux commented Apr 14, 2022

vigoux commented Apr 15, 2022

vigoux commented Apr 15, 2022

vigoux commented May 2, 2022

vigoux commented May 3, 2022

vigoux commented May 3, 2022

bfredl commented May 3, 2022

clason commented May 3, 2022

vigoux commented May 3, 2022

clason commented May 3, 2022

lewis6991 commented Jun 16, 2022 •

edited

Loading

vigoux commented Jun 16, 2022

vigoux commented Jun 16, 2022

lewis6991 commented Jun 16, 2022

vigoux commented Jun 16, 2022 •

edited

Loading

bfredl commented Jun 16, 2022

vigoux commented Jun 16, 2022

bfredl commented Jun 16, 2022

lewis6991 commented Jun 16, 2022 •

edited

Loading

clason commented Jun 16, 2022

lewis6991 commented Jun 16, 2022

This comment was marked as off-topic.

This comment was marked as off-topic.

refactor(treesitter): rely more on ts correctness #18109

Are you sure you want to change the base?

refactor(treesitter): rely more on ts correctness #18109

Conversation

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022

vigoux commented Apr 14, 2022

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022 • edited Loading

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022

vigoux commented Apr 14, 2022

bfredl commented Apr 14, 2022

vigoux commented Apr 14, 2022

vigoux commented Apr 15, 2022

vigoux commented Apr 15, 2022

vigoux commented May 2, 2022

vigoux commented May 3, 2022

vigoux commented May 3, 2022

bfredl commented May 3, 2022

clason commented May 3, 2022

vigoux commented May 3, 2022

clason commented May 3, 2022

lewis6991 commented Jun 16, 2022 • edited Loading

vigoux commented Jun 16, 2022

vigoux commented Jun 16, 2022

lewis6991 commented Jun 16, 2022

vigoux commented Jun 16, 2022 • edited Loading

bfredl commented Jun 16, 2022

vigoux commented Jun 16, 2022

bfredl commented Jun 16, 2022

lewis6991 commented Jun 16, 2022 • edited Loading

clason commented Jun 16, 2022

lewis6991 commented Jun 16, 2022

This comment was marked as off-topic.

This comment was marked as off-topic.

bfredl commented Apr 14, 2022 •

edited

Loading

lewis6991 commented Jun 16, 2022 •

edited

Loading

vigoux commented Jun 16, 2022 •

edited

Loading

lewis6991 commented Jun 16, 2022 •

edited

Loading