Autocomplete: Fix retrieval hints #2652

philipp-spiess · 2024-01-10T10:23:27Z

I noticed that the context size hints returned values suffixed with chars but one of the values was not going through the same tokensToChars estimation.

This value was used by the context mixer and the retrieval strategies. Luckily, the Jaccard similarity strategy was not using that value. However, the context mixer was likely truncating results because of this, so I would suspect this to increase the average context length a bit (which could have a negative impact on performance). However there was another bug that counter-acted this change a bit. The previous totalFileContextChars was used to only count the additional code snippets (so excluding prefix and suffix) while the documentation said that this number is inclusive.

Also, the actual truncation happens at the prompt level and is unaffected by these hints. These only tell the context retriever how much context it should retrieve.

Some math shows us that the actual difference is not big:

Before:

maxRetrievedChars = maxContextTokens * 0.9

After:

maxRetrievedChars = 4 *  maxContextTokens * 0.9 - 4 * maxContextTokens * 0.6 - 4 * maxContextTokens * 0.1
maxRetrievedChars = 4 * maxContextTokens * ( 0.9 - 0.6 - 0.1 )
maxRetrievedChars = 4 * maxContextTokens * ( 0.2 )
maxRetrievedChars = maxContextTokens * 0.8

So yeah, almost the same 😅

However, with the new change, if fewer actual suffix or prefix characters are used, we will retrieve more context to fill up the window (which IMO is a good thing)

Test plan

Enable the feature flag
Make some autocomplete request
Observe that context snippers are still added.

philipp-spiess · 2024-01-10T10:26:10Z

vscode/src/completions/providers/provider.ts

+        totalFileContextChars: Math.floor(tokensToChars(0.9 * maxContextTokens)), // keep 10% margin for preamble, etc.
        prefixChars: Math.floor(tokensToChars(0.6 * maxContextTokens)),
        suffixChars: Math.floor(tokensToChars(0.1 * maxContextTokens)),


Somehow this doesn't add up to 100% haha, let me run some more tests to make sure this still works as expected

Updated the PR descriptions with my findings. This was a fun one!

…into ps/fix-retrieval-hints

valerybugakov

Nice find!

…into ps/fix-retrieval-hints

Autocomplete: Fix retrieval hints

3ccf218

philipp-spiess requested review from valerybugakov and a team January 10, 2024 10:23

philipp-spiess self-assigned this Jan 10, 2024

philipp-spiess added 2 commits January 10, 2024 11:24

Add change log

05fdaa2

Merge branch 'main' into ps/fix-retrieval-hints

3cf116f

philipp-spiess commented Jan 10, 2024

View reviewed changes

philipp-spiess marked this pull request as draft January 10, 2024 10:26

Fix bugs

ec4da50

philipp-spiess marked this pull request as ready for review January 10, 2024 10:50

philipp-spiess added 2 commits January 10, 2024 11:54

Merge branch 'ps/fix-retrieval-hints' of github.com:sourcegraph/cody …

6be3808

…into ps/fix-retrieval-hints

Fixes

a79231b

abeatrix approved these changes Jan 10, 2024

View reviewed changes

Update tests

87bd3a3

valerybugakov approved these changes Jan 11, 2024

View reviewed changes

philipp-spiess added 6 commits January 11, 2024 10:27

Merge branch 'main' into ps/fix-retrieval-hints

5c44976

Fixes

91d02d5

Merge branch 'ps/fix-retrieval-hints' of github.com:sourcegraph/cody …

ce9c3b2

…into ps/fix-retrieval-hints

Merge remote-tracking branch 'origin/main' into ps/fix-retrieval-hints

39ed13c

Fix change log

8bd2bcd

?? how

44a77e9

philipp-spiess merged commit 97f9617 into main Jan 11, 2024
15 checks passed

philipp-spiess deleted the ps/fix-retrieval-hints branch January 11, 2024 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autocomplete: Fix retrieval hints #2652

Autocomplete: Fix retrieval hints #2652

philipp-spiess commented Jan 10, 2024 •

edited

Loading

philipp-spiess Jan 10, 2024

philipp-spiess Jan 10, 2024

valerybugakov left a comment

Autocomplete: Fix retrieval hints #2652

Autocomplete: Fix retrieval hints #2652

Conversation

philipp-spiess commented Jan 10, 2024 • edited Loading

Test plan

philipp-spiess Jan 10, 2024

Choose a reason for hiding this comment

philipp-spiess Jan 10, 2024

Choose a reason for hiding this comment

valerybugakov left a comment

Choose a reason for hiding this comment

philipp-spiess commented Jan 10, 2024 •

edited

Loading