Move semantic tokens to LSP implementation #1806

0dinD · 2021-06-24T21:37:53Z

Fixes #1678
Fixes redhat-developer/vscode-java#1999

This PR moves the semantic tokens implementation from a custom command to the proper LSP 3.16 implementation.

Perhaps someone could update the LSP support wiki page once this PR is merged? Currently, we only support textDocument/semanticTokens/full, not textDocument/semanticTokens/full/range or textDocument/semanticTokens/delta.

Eskibear

Good job, we can remove extra delegate commands now.

I'll try it later and below are some immediate thoughts.

A monitor should be passed into AST provider, for cancellation and better progress indication.
Some changes look unrelevant to semantic tokens. I suggest to create a separate PR if you think they are buggy.
LSP4J is using wrapper objects instead of primitive types. I remember in our previous discussion we switched to use primitives for performance concern. We should double confirm whether there's any performance degradation.

org.eclipse.jdt.ls.core/src/org/eclipse/jdt/ls/core/internal/handlers/JDTLanguageServer.java

...eclipse.jdt.ls.core/src/org/eclipse/jdt/ls/core/internal/handlers/SemanticTokensHandler.java

Eskibear · 2021-07-15T03:04:21Z

ping~
@0dinD will you update this PR accordingly?

0dinD · 2021-07-15T15:16:16Z

Yes, I plan to look at this again in a few days. I've been quite busy recently so that's why I haven't responded yet.

0dinD · 2021-07-18T11:20:58Z

A monitor should be passed into AST provider

Thanks, good catch!

Some changes look unrelevant to semantic tokens. I suggest to create a separate PR if you think they are buggy.

Those changes don't affect the code, I was just trying to get rid of this warning:

If you want me to revert the change that's fine, but I'm not going to make another PR for it. I usually take the opportunity to fix small things like this in the files that I'm changing for a PR.

LSP4J is using wrapper objects instead of primitive types. I remember in our previous discussion we switched to use primitives for performance concern. We should double confirm whether there's any performance degradation.

I switched to primitives mainly because it seemed like the right thing to do while I was at it, but the main performance improvement probably came from avoiding to resolve bindings in some situations and using bitmasks for modifiers. If you want to test this then go ahead, but either way we would need upstream changes to LSP4J in order to use primitives, which I don't have time to deal with right now (and I'm not sure if the performance gain is worth it).

After testing this PR again, I noticed that unfortunately redhat-developer/vscode-java#1597 is back, because now we don't have the client-side code in the extension to handle "stale" requests. We could re-implement the same workaround in the extension using LanguageClientOptions.middleware.provideDocumentSemanticTokens, but I really wanted to understand the root cause of this annoying issue, so I went back to debugging.

Finally, I believe I've found the problem! This is what happens when we get a flicker of incorrect highlighting after frequent changes to the document:

The client requests semantic tokens for version 1 of the document.
The server begins computing the request.
The client detects a change to the document and notifies the server.
The server now computes tokens for version 2 instead.
The server completes the request.
The client applies the tokens, but adjusts them according to the state of the document at the version of the initial request (version 1).

The relevant code can be found here. By setting a breakpoint in the VS Code devtools and removing the changes from pendingChanges, I was able to verify that the semantic tokens get applied correctly without those lines of code.

But this isn't necessarily a bug with VS Code. It just expects the semantic tokens returned by the server to be for the exact version of the document at the time of the request, not necessarily the latest version. Knowing this, I just removed the call to waitForLifecycleJobs, which seems to have fixed the issue. It should also provide a pretty large performance improvement for semantic highlighting, since waiting for lifecycle jobs was taking the majority of the compute time for most requests.

However, the issue could still be reproduced if the server updates the document before the AST is computed. It's not something I'm able to reproduce on my machine, and I don't know if it's possible at all to happen in a real-world scenario. But by inserting a simple Thread.sleep(1000); at the top of SemanticTokensHandler.full, the issue once again appears. Again, what happens is that the server receives a request at version 1 of the document, but computes tokens for version 2 which came in during the Thread.sleep. The tokens are correct for the current version of the document, but since VS Code thinks they are for version 1, it tries to "fix" them, which in reality just messes them up.

The only way to work around this seems to be cancelling the request. But returning null will clear all the tokens (causing flickers of no highlighting), so it has to be done another way. Again, we could do it like redhat-developer/vscode-java#1632, via middleware in the languageclient. But I'm wondering whether or not it would be a good idea to implement it on the server side instead. There are some LSP error codes which could be used to cancel the request and avoid clearing the tokens, see microsoft/vscode-languageserver-node#576 and microsoft/vscode-languageserver-node@dae62de. But I think it would require an update to the vscode-languageclient package in the extension.

What do you think? The easiest fix would probably be to apply the old workaround in the extension, but it might be more "correct" to fix the problem on the server side depending on whether or not all clients expect the tokens to be for the version at the time of the request. Can't find anything about that in the spec though.

Eskibear · 2021-07-21T03:35:19Z

Knowing this, I just removed the call to waitForLifecycleJobs, which seems to have fixed the issue. It should also provide a pretty large performance improvement

It makes sense. Good job.

But by inserting a simple Thread.sleep(1000); at the top of SemanticTokensHandler.full, the issue once again appears.

Sorry but I cannot reproduce it. In my test, after copy&paste for a couple of times, in the end vscode seems to always send an extra request to fetch tokens, and LS can return correct results. The only consequence by inserting above line is, a 1000ms flicker before I get correct highlighting.
I'm using latest vscode-insiders(b805d2e94937976bb17d0439f57fcd3a9d423c31), and test with vscode-languageclient@7.0.0 which is used current vscode-java extension.

but it might be more "correct" to fix the problem on the server side depending on whether or not all clients expect the tokens to be for the version at the time of the request.

Agree.

0dinD · 2021-07-25T16:00:29Z

Sorry but I cannot reproduce it. In my test, after copy&paste for a couple of times, in the end vscode seems to always send an extra request to fetch tokens, and LS can return correct results. The only consequence by inserting above line is, a 1000ms flicker before I get correct highlighting.

Yes, that flicker is the issue I was referring to. So only "part two" of the issue, not "part one" (see explanation here).

I just pushed some changes to this PR which should fix this issue. It works by cancelling the request by throwing a ResponseErrorException with the ContentModified error code, if the document changes while semantic tokens are computed. I think sending an error with the ContentModified code is correct based on reading microsoft/vscode-languageserver-node#576 and microsoft/language-server-protocol#584.

You'll also need to use redhat-developer/vscode-java#2000, because the languageclient needed an update (microsoft/vscode-languageserver-node@dae62de). On the old version it just clears all tokens when receiving an error, which causes flickering (microsoft/vscode-languageserver-node#576).

BTW, I realized that we do need to wait for document lifecycle jobs, because otherwise it's possible that the server has received the latest version of the document but hasn't processed it yet, leading to incorrect tokens that don't automatically repair themselves.

Signed-off-by: 0dinD <zerodind@gmail.com>

0dinD · 2021-09-06T19:01:50Z

Test failures seem unrelated, the change in 023a0a1 was just to rebase against master and a6b1f1d passed all tests. When I ran some of the tests locally, they sometimes failed and sometimes passed, at random.

Eskibear · 2021-09-06T20:33:56Z

I'm ok with this PR. Just to ensure we don't have a regression.

As for the flickering issue on vscode, if bumping vscode-languageclient can perfectly fix that, I suggest we get this PR merged, and update languageclient lib in vscode-java.

0dinD · 2021-09-06T21:38:49Z

Not sure if I completely understand your question, so to clarify and recap:

This PR and redhat-developer/vscode-java#2000 started out as just removing the delegate commands and using LSP for semantic tokens instead. In doing so, I discovered that the offset bug with semantic tokens got introduced again, because the previous workaround was in the extension's client-side code.

So I started investigating a more robust solution for how to solve the problem on the server-side, which relies on cancelling the request on the server side by returning an error with the ContentModified code as mentioned here. This is needed because the client expects the result of the semantic tokens command to return tokens for the version of the document at the time of the request, i.e. if the document updates mid-request, the tokens shouldn't. As far as I know this can't be guaranteed in an easy way when using the shared AST provider, that's why I cancel the request. This also allows us to return early, as opposed to "fixing" it in client-side code, which has to wait for the request to finish.

The last problem was that when cancelling the request using the ContentModified error, the tokens would flicker in VS Code (microsoft/vscode-languageserver-node#576). That's why an update to the languageclient library is needed (if you use the old version with this PR you should see flickering when making lots of edits to a document).

So yes, if you merge this PR, also merge redhat-developer/vscode-java#2000 or else there will be flickering. If you want to see the difference for yourself, make sure to completely rebuild the extension when changing the version of the languageclient.

And because I finally understood in entirety the way the offset bug occurred, I believe all edge cases should be covered now, as opposed to the previous workaround in redhat-developer/vscode-java#1632 which just discards the tokens on the client-side and naively uses a timer to "wait out" the document changes before trying again. I can totally see there being edge cases with this workaround where the bug can still happen if a request is abnormally slow.

Eskibear · 2021-09-07T02:01:02Z

But by inserting a simple Thread.sleep(1000); at the top of SemanticTokensHandler.full, the issue once again appears.

There was misunderstanding, that I thought flickering issue was still there even after you apply this PR and update langaugeclient to v7.1.0-next.5. Now it's clear that you are talking about current workaround (delay) doesn't cover all edge cases.

So yes, if you merge this PR, also merge redhat-developer/vscode-java#2000 or else there will be flickering.
I believe all edge cases should be covered now,

This makes sense, as I saw no flickering issue last time when I tried with you both fixes in (#1806 & redhat-developer/vscode-java#2000).

If you want to see the difference for yourself, make sure to completely rebuild the extension when changing the version of the languageclient.

I'll test this PR + languageclient@v7.0.0, see if I can see the difference.

Eskibear · 2021-09-07T02:01:38Z

test this please

Eskibear · 2021-09-07T03:21:09Z

By inserting Thread.sleep(1000) at beginning of SemanticTokensHandler.full(...), I can reproduce the flickering issue with both combinations below, and I don't see difference:

this PR + redhat-developer/vscode-java#2000 (vscode-languageclient@7.1.0-next.5)
this PR + redhat-developer/vscode-java#2000 + downgrade to vscode-languageclient@7.0.0

flickering.mp4

I can totally see there being edge cases with this workaround where the bug can still happen if a request is abnormally slow.

This PR looks good to me, and I'm happy to get it merged as soon as possible. My only concern is, for those unpowerful machines, requests are slow and users might frequently see the flickering issue. This turns out to be a "regression" to them if they don't see flickering at the moment. Any idea to get this coverred?

Let me know if I misunderstand anything.

0dinD · 2021-09-07T09:57:24Z

By inserting Thread.sleep(1000) at beginning of SemanticTokensHandler.full(...), I can reproduce the flickering issue with both combinations below, and I don't see difference:

That to me sounds like you haven't applied this PR (or it hasn't been compiled successfully), I can't reproduce what you're showing in the video even with a Thread.sleep at the head of SemanticTokensHandler.full. I can reproduce what you're showing if I comment out the three documentMonitor.checkChanged(); lines, essentially reverting the offset fix from this PR.

The difference between the languageclient versions should be that when using the old version, the tokens flicker completely off for a second, i.e. you only see TextMate-based highlighting. That's because the old languageclient version clears the tokens when receiving the ContentModified error, whereas the new version simply ignores the incoming tokens but keeps the old.

Could you please try again, making sure to include both PRs, and that the extension and server have been successfully recompiled? If you can still reproduce the error, can you share your settings/environment so I can try to reproduce this?

0dinD · 2021-09-07T23:21:18Z

languageclient 7.1.0-next.5: (works perfectly, unless I revert the offset fix by commenting out some code)

offset-flicker.mp4

languageclient 7.0.0: (flashes of no semantic tokens, notice the package names)

textmate-flicker.mp4

Again, note that when changing the version of the languageclient you need to do a full recompile, i.e. kill the watch task and start the build again after running npm install, otherwise you won't see the bug shown in the second video.

Eskibear · 2021-09-08T03:05:04Z

I'm able to see it now, it's working as expected! (I was probably using an early commit without DocumentMonitor...)

Eskibear · 2021-09-08T03:05:11Z

test this please

Eskibear

LGTM. Kudos to your work.

Eskibear · 2021-09-08T03:22:07Z

CI failed, tracking in #1869

fbricon · 2021-09-08T10:39:52Z

Thanks @0dinD!

Requires eclipse-jdt/eclipse.jdt.ui#335 Closes eclipse-jdtls#1806 Signed-off-by: David Thompson <davthomp@redhat.com>

Requires eclipse-jdt/eclipse.jdt.ui#335 Closes #1806 Signed-off-by: David Thompson <davthomp@redhat.com>

0dinD mentioned this pull request Jun 24, 2021

Move semantic tokens to LSP implementation redhat-developer/vscode-java#2000

Merged

Eskibear reviewed Jun 25, 2021

View reviewed changes

0dinD force-pushed the semantic-tokens-lsp branch from ca21621 to 7047a4d Compare July 18, 2021 11:17

0dinD force-pushed the semantic-tokens-lsp branch 4 times, most recently from 3f62dd5 to a6b1f1d Compare July 25, 2021 15:18

Move semantic tokens to LSP implementation

023a0a1

Signed-off-by: 0dinD <zerodind@gmail.com>

0dinD force-pushed the semantic-tokens-lsp branch from a6b1f1d to 023a0a1 Compare September 6, 2021 16:32

0dinD mentioned this pull request Sep 6, 2021

Since July update cause java highlight break redhat-developer/vscode-java#1597

Closed

Eskibear mentioned this pull request Sep 7, 2021

CI failed #1869

Closed

Eskibear approved these changes Sep 8, 2021

View reviewed changes

fbricon merged commit 00b8336 into eclipse-jdtls:master Sep 8, 2021

0dinD mentioned this pull request Oct 21, 2021

Semantic tokens can still become desynced/offset redhat-developer/vscode-java#2176

Closed

datho7561 added a commit to datho7561/eclipse.jdt.ls that referenced this pull request Nov 18, 2022

QuickFix for annotation missing attributes

4319edb

Requires eclipse-jdt/eclipse.jdt.ui#335 Closes eclipse-jdtls#1806 Signed-off-by: David Thompson <davthomp@redhat.com>

datho7561 mentioned this pull request Nov 18, 2022

QuickFix for annotation missing attributes #2335

Merged

datho7561 added a commit to datho7561/eclipse.jdt.ls that referenced this pull request Dec 7, 2022

QuickFix for annotation missing attributes

35a0be9

Requires eclipse-jdt/eclipse.jdt.ui#335 Closes eclipse-jdtls#1806 Signed-off-by: David Thompson <davthomp@redhat.com>

datho7561 added a commit to datho7561/eclipse.jdt.ls that referenced this pull request Dec 8, 2022

QuickFix for annotation missing attributes

ee94013

Requires eclipse-jdt/eclipse.jdt.ui#335 Closes eclipse-jdtls#1806 Signed-off-by: David Thompson <davthomp@redhat.com>

datho7561 added a commit to datho7561/eclipse.jdt.ls that referenced this pull request Dec 9, 2022

QuickFix for annotation missing attributes

fa58379

Requires eclipse-jdt/eclipse.jdt.ui#335 Closes eclipse-jdtls#1806 Signed-off-by: David Thompson <davthomp@redhat.com>

rgrunber pushed a commit that referenced this pull request Dec 9, 2022

QuickFix for annotation missing attributes

7c7a064

Requires eclipse-jdt/eclipse.jdt.ui#335 Closes #1806 Signed-off-by: David Thompson <davthomp@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move semantic tokens to LSP implementation #1806

Move semantic tokens to LSP implementation #1806

0dinD commented Jun 24, 2021 •

edited

Eskibear left a comment

Eskibear commented Jul 15, 2021

0dinD commented Jul 15, 2021

0dinD commented Jul 18, 2021

Eskibear commented Jul 21, 2021

0dinD commented Jul 25, 2021 •

edited

0dinD commented Sep 6, 2021

Eskibear commented Sep 6, 2021

0dinD commented Sep 6, 2021 •

edited

Eskibear commented Sep 7, 2021

Eskibear commented Sep 7, 2021

Eskibear commented Sep 7, 2021

0dinD commented Sep 7, 2021 •

edited

0dinD commented Sep 7, 2021 •

edited

Eskibear commented Sep 8, 2021

Eskibear commented Sep 8, 2021

Eskibear left a comment

Eskibear commented Sep 8, 2021

fbricon commented Sep 8, 2021

Move semantic tokens to LSP implementation #1806

Move semantic tokens to LSP implementation #1806

Conversation

0dinD commented Jun 24, 2021 • edited

Eskibear left a comment

Choose a reason for hiding this comment

Eskibear commented Jul 15, 2021

0dinD commented Jul 15, 2021

0dinD commented Jul 18, 2021

Eskibear commented Jul 21, 2021

0dinD commented Jul 25, 2021 • edited

0dinD commented Sep 6, 2021

Eskibear commented Sep 6, 2021

0dinD commented Sep 6, 2021 • edited

Eskibear commented Sep 7, 2021

Eskibear commented Sep 7, 2021

Eskibear commented Sep 7, 2021

0dinD commented Sep 7, 2021 • edited

0dinD commented Sep 7, 2021 • edited

Eskibear commented Sep 8, 2021

Eskibear commented Sep 8, 2021

Eskibear left a comment

Choose a reason for hiding this comment

Eskibear commented Sep 8, 2021

fbricon commented Sep 8, 2021

0dinD commented Jun 24, 2021 •

edited

0dinD commented Jul 25, 2021 •

edited

0dinD commented Sep 6, 2021 •

edited

0dinD commented Sep 7, 2021 •

edited

0dinD commented Sep 7, 2021 •

edited