Autocomplete: Various latency related tweaks and new eager cancellation experiment #3096

philipp-spiess · 2024-02-09T11:31:16Z

A few small tweaks from my learnings of looking at some traces:

Fixes a bug where the debounce time was increased for non-local models
Sets the same debounce time for single-line and multi-line
Remove some config evaluations off the critical path. Those are heavily cached but it would still cause the very first completion to be slower
Add a new eager cancellation experiment that will cancel requests as soon as a new request is created and reduces the debounce time significantly to try and counter the latency regression

Test plan

For the new experiment, I added a abort handler in the fireworks client and ensured it was heavily hit
For the rest, just made sure completions still work. The changes are trivial.

…on experiment

valerybugakov · 2024-02-09T11:57:28Z

vscode/src/completions/inline-completion-item-provider.ts

+            const isEagerCancellationEnabled = completionProviderConfig.getPrefetchedFlag(
+                FeatureFlag.CodyAutocompleteEagerCancellation
+            )
+            const debounceInterval = isLocalProvider ? 125 : isEagerCancellationEnabled ? 10 : 75


🔥 ☄️

valerybugakov · 2024-02-09T12:02:38Z

vscode/src/completions/request-manager.ts

+                        if (!eagerCancellation) {
+                            this.testIfResultCanBeRecycledForInflightRequests(


Can we keep this functionality when eagerCancellation === true? We can throttle (is it the right word here?) requests — with low debounce, we will have them almost on every keystroke, but instead of canceling all of them but the last one, we can keep on request in every number of requests made in the previous 100ms + always keep the tail one.

This way, we preserve the nice UX where the completion is generated early, and a user continues typing as suggested AND decrease the tail completion delay by 65ms. WDYT?

I’m not sure I understand this right. If we want to keep this, we would have to keep more than just the last request alive (so that another request can actually answer another one). Are you recommending we keep all but the last 2 completions active?

Discussed on the call. We're going to follow-up on that in a separate PR.

Autocomplete: Various latency related tweaks and new eager cancellati…

49e897a

…on experiment

philipp-spiess requested review from valerybugakov and a team February 9, 2024 11:31

philipp-spiess self-assigned this Feb 9, 2024

philipp-spiess added 2 commits February 9, 2024 12:32

changelog

bf16502

fixes

b597274

valerybugakov reviewed Feb 9, 2024

View reviewed changes

Fix request manager tests

dcf93ad

valerybugakov approved these changes Feb 9, 2024

View reviewed changes

philipp-spiess merged commit 703afda into main Feb 9, 2024
15 checks passed

philipp-spiess deleted the ps/ac-latency-experiments-and-tweaks branch February 9, 2024 12:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autocomplete: Various latency related tweaks and new eager cancellation experiment #3096

Autocomplete: Various latency related tweaks and new eager cancellation experiment #3096

philipp-spiess commented Feb 9, 2024

valerybugakov Feb 9, 2024

valerybugakov Feb 9, 2024

philipp-spiess Feb 9, 2024

valerybugakov Feb 9, 2024

		if (!eagerCancellation) {
		this.testIfResultCanBeRecycledForInflightRequests(

Autocomplete: Various latency related tweaks and new eager cancellation experiment #3096

Autocomplete: Various latency related tweaks and new eager cancellation experiment #3096

Conversation

philipp-spiess commented Feb 9, 2024

Test plan

valerybugakov Feb 9, 2024

Choose a reason for hiding this comment

valerybugakov Feb 9, 2024

Choose a reason for hiding this comment

philipp-spiess Feb 9, 2024

Choose a reason for hiding this comment

valerybugakov Feb 9, 2024

Choose a reason for hiding this comment