-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Tell Cody to not reveal they're created by Anthropic #52298
Conversation
First, simple attempt to fix https://github.com/sourcegraph/sourcegraph/issues/52061 Looks like it might work. Since it's only a single line, worth a shot?
Bundle size report 📦
Look at the Statoscope report for a full comparison between the commits 6241d4d and d08b09e or learn more. Open explanation
|
|
It is incredibly cheesy, but since Anthropic tends to respond with a set phrase when it goes off the rails, can we just grep for and suppress that phrase in the output? (Note, it is different for different languages, but doing it for English is a start.) Another idle thought, maybe we should add a metric for how often the output mentions "Anthropic". (Although I have certainly asked Cody to write code to call the Anthropic API, which would be a false positive. Interestingly it has memorized an old URL and parameter names of theirs...) |
|
I took a look at this with my prompt testing tool and I think something like this might be effective. Caveat: My snapshot of the Cody prompt might be out of date, and I am testing temperature 0.2, would need to look at what our current environment is like. I could usually suppress that specific Anthropic apology this way: This longer variation also seemed effective: |
|
Nice, @dominiccooney, that seemed to fix it! |
|
So far it's pretty reliable and it's now only the @dominiccooney's prompt. Should we go with it? |
First, simple attempt to fix https://github.com/sourcegraph/sourcegraph/issues/52061 Looks like it might work. Since it's only a single line, worth a shot?  ## Test plan - Manual testing by asking Cody "who are you?" "who created you?" "how are though" (from original ticket)
First, simple attempt to fix https://github.com/sourcegraph/sourcegraph/issues/52061
Looks like it might work. Since it's only a single line, worth a shot?
Test plan