fix: preamble leak for anthropic #1274

abeatrix · 2023-10-02T22:42:23Z

Fix issues that cause the preamble to leak for anthropic by enclosing the code with backticks to make sure Claude instant knows where the code ends, instead of having the code snippets end with a new line or unfinished code/comment, which would confuse Claude and lead it to think the code was not completed so they should continue the completion from the end of the code snippet instead of the code inside the tags.

Before

After

Test plan

build Cody from this branch
Go to line 213 of vscode/webviews/Chat.tsx in your editor
Instead of seeing Human: as a suggestion, Cody should not suggest anything that starts with Human here (or anywhere)
You should still get suggestions from Cody in other appropriate places

valerybugakov

It looks logical and solves the issue we found, but I wonder if it affects the output unexpectedly (like when we found out that Claude doesn't like trailing spaces). Did you have a chance to run this change against our completions dataset?

philipp-spiess · 2023-10-03T11:06:10Z

Agree with Valery. I’m a bit concerned too about shipping this so shortly before the release. Maybe as a quick fix we can add a filter for completions that start with Human: from Anthropic and then we can land this after the release and test it a bit more?

Unrelated but instead of \n<--End of code--> did you ever try to wrap everything in an xml tag like so?

<code>${infillPrefix}${OPENING_CODE_TAG}${CLOSING_CODE_TAG}${infillSuffix}</code>`

abeatrix · 2023-10-03T12:18:15Z

Agree with Valery. I’m a bit concerned too about shipping this so shortly before the release. Maybe as a quick fix we can add a filter for completions that start with Human: from Anthropic and then we can land this after the release and test it a bit more?

Unrelated but instead of \n<--End of code--> did you ever try to wrap everything in an xml tag like so?
<code>${infillPrefix}${OPENING_CODE_TAG}${CLOSING_CODE_TAG}${infillSuffix}</code>`

I tried that at the beginning but couldn't get it to work, I'm planning to spend some more time later to see if we can improve the prompt later, but for now I am enclosing the prompt with backticks like we do in Chat. I run the test suite and it seems like adding the {infillBlock} between the XML tags works better, but since this is LLM I think the output is going to be slightly different anyway. Let me know if you think we should remove the ${infillBlock} from the Human prompt and just enclose the code with backticks for now. This at least would solve the issue where Claude doesn't know if the code snippet has ended or not 😄

abeatrix · 2023-10-03T15:20:30Z

as discussed with @philipp-spiess on slack, we will merge this fix and make additional changes in patches if required.

fix: preamble leak for anthropic

e182445

abeatrix requested review from philipp-spiess, valerybugakov and a team October 2, 2023 22:42

arafatkatze approved these changes Oct 3, 2023

View reviewed changes

valerybugakov approved these changes Oct 3, 2023

View reviewed changes

enclose code with backticks and add infillblock

f5a1dc8

abeatrix merged commit 9602794 into main Oct 3, 2023
12 checks passed

abeatrix deleted the bee/fix-preamble-leak branch October 3, 2023 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preamble leak for anthropic #1274

fix: preamble leak for anthropic #1274

abeatrix commented Oct 2, 2023 •

edited

Loading

valerybugakov left a comment

philipp-spiess commented Oct 3, 2023

abeatrix commented Oct 3, 2023

abeatrix commented Oct 3, 2023

fix: preamble leak for anthropic #1274

fix: preamble leak for anthropic #1274

Conversation

abeatrix commented Oct 2, 2023 • edited Loading

Before

After

Test plan

valerybugakov left a comment

Choose a reason for hiding this comment

philipp-spiess commented Oct 3, 2023

abeatrix commented Oct 3, 2023

abeatrix commented Oct 3, 2023

abeatrix commented Oct 2, 2023 •

edited

Loading