refactor: update preamble rules and mixin #1442

abeatrix · 2023-10-20T00:48:00Z

Updating current preamble, rules, and prompt mixin with new findings from #907.

Issue 1: Cody hallucinates about file path and code

One of the main issues we saw in Chat is Cody hallucinating about files and code that do not exist in the shared context.

I noticed this is mainly because of how we phrased our context, as Cody do not understand the relationship between each code snippet we share with them. For more information, see #907 (comment)

One of the many prompts I've tried to prevent hallucination is to add Do not make any assumptions about the code and file names or any misleading information to the question, which has been working well for me. This is also the prompt we added to all our commands. A customer has also confirmed that adding this to their questions stopped Cody from hallucinating and received better responses from Chat, this is why I propose adding it to PromptMixin, which will be added to the start of every question (instead of Preabmle, which is added to the start of the session and will get left out as the conversation gets longer.)

I don't think this will be 100% hallucination-proof, but it works a lot better than what we currently have.

Before

lib/share/ui/src/components/CodeBlock is not the correct path

After

make well-informed answers

Issue 2: The current preambles need to be updated, and not working as intended most of the times.

Addressed by this PR with the following changes:

Simplified the preamble actions and rules text to be more concise.
Removed redundant rules that will always be false, e.g. Cody never has direct access to your file or repository even when we tell them they do.
Removed detailed instructions about code formatting and limitations, leaving just the essential rules
Shortened the preamble answer to focus on the core assistant persona and capabilities

Issue 3: Cody will not answer in language that is not the default language of the editor

CLOSE https://github.com/sourcegraph/cody/discussions/1011 && #988

Many users have complained that they cannot get Cody to answer their questions because Cody refused to answer questions that is not the same as the default language of the editor.

Addressed by this PR with the following changes: Remove languagePromptMixin from the activate event for editors, which allows Cody to answer in the same language as the question

Before

After

Test plan

Ask Cody a question in another language that is not the same as your editor. Cody should still be able to answer your question in the same language.
Ask Cody a question in editor, and then ask the same question in Web to compare the quality. Questions where Cody hallucinates on Web should not happen in the editor when using the updated prompts.

Example question to ask in the cody repo: Do we format the completion item before we suggest them?

valerybugakov

Love the idea of adding Never make any assumptions nor provide any misleading or hypothetical examples.' to the prompt!

Asked questions about other changes in the PR.

valerybugakov · 2023-10-20T01:36:10Z

lib/shared/src/chat/preamble.ts

- If you do not have access to a repository, tell me to add additional repositories to the chat context using repositories selector below the input box to help you answer the question.
- Only reference file names, repository names or URLs if you are sure they exist.`
+const multiRepoRules = `Important rules to follow in all your responses:
+- All code snippets must be markdown-formatted, and enclosed in triple backticks.


What's the rationale for removing the triple backticks example?

From my experience, It is harder to tell Claude not to return code with backticks than including them lol It was a big issue for fixup because it would includes the backticks in the edited code 😓
I don't think Claude has any problem connecting triple backticks with markdown, so the example IMO is unnecessary. Here is an example:

valerybugakov · 2023-10-20T01:38:03Z

vscode/src/extension.common.ts

@@ -39,7 +39,7 @@ export interface PlatformContext {

 export function activate(context: vscode.ExtensionContext, platformContext: PlatformContext): ExtensionApi {
    const api = new ExtensionApi()
-    PromptMixin.add(languagePromptMixin(vscode.env.language))
+    PromptMixin.add(defaultPromptMixin())


Is there a way to A/B test prompt changes as we do with autocomplete improvements?

I think we have one but not sure if it's active 😅

valerybugakov · 2023-10-20T01:39:41Z

lib/shared/src/prompt/prompt-mixin.ts

+ */
+export function defaultPromptMixin(): PromptMixin {
+    const identity = 'Reply as Cody, a coding assistant developed by Sourcegraph.'
+    const hallucinate = 'Never make any assumptions nor provide any misleading or hypothetical examples.'


It would be interesting to test how this sentence affects CAR.

Haha we can put it behind a feature flag.

Can't wait to move to Starcoder so we don't have to worry about this 😏

Can't wait to move to Starcoder so we don't have to worry about this 😏

May I remind you of https://huggingface.co/bigcode/starcoder/discussions/50#647f8078222b8c6a6f3e1c27 😆

valerybugakov · 2023-10-20T01:42:30Z

lib/shared/src/chat/preamble.ts

-const answer = `Understood. I am Cody, an AI assistant made by Sourcegraph to help with programming tasks.
-I work inside a text editor. I have access to your currently open files in the editor.
+const answer = `Understood. I am Cody, an AI assistant developed by Sourcegraph to help with programming tasks.
+I am working with you inside an editor, and I will answer your questions based on the context you provide from your current codebases.


Interesting, did you find that the present continuous tense works better for the prompt?

I do. I actually learnt this from Claude's docs: https://docs.anthropic.com/claude/docs/advanced-text-analysis

They also mentioned somewhere in their docs that it is better to speak to Claude in first person instead of third :)

Nice, thanks for sharing the doc!

jdorfman · 2023-10-20T15:37:05Z

This is a massive win for our users. Thanks, @abeatrix for digging in!

refactor: update preamble rules and mixin

558963f

abeatrix requested a review from a team October 20, 2023 00:48

valerybugakov approved these changes Oct 20, 2023

View reviewed changes

abeatrix merged commit c733de3 into main Oct 23, 2023
14 checks passed

abeatrix deleted the bee/new-rules branch October 23, 2023 18:48

abeatrix mentioned this pull request Oct 27, 2023

[Tracking Issue]: Refactor Chat to No-magic Chat #1410

Closed

abeatrix added clients/vscode chat/commands Chat and Commands labels Oct 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: update preamble rules and mixin #1442

refactor: update preamble rules and mixin #1442

abeatrix commented Oct 20, 2023

valerybugakov left a comment

valerybugakov Oct 20, 2023

abeatrix Oct 20, 2023

valerybugakov Oct 20, 2023

abeatrix Oct 20, 2023

valerybugakov Oct 20, 2023

abeatrix Oct 20, 2023

valerybugakov Oct 20, 2023

philipp-spiess Oct 20, 2023

valerybugakov Oct 20, 2023

abeatrix Oct 20, 2023

valerybugakov Oct 20, 2023

jdorfman commented Oct 20, 2023

refactor: update preamble rules and mixin #1442

refactor: update preamble rules and mixin #1442

Conversation

abeatrix commented Oct 20, 2023

Before

After

Before

After

Test plan

valerybugakov left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jdorfman commented Oct 20, 2023