Cody: Add support for server-side token limits to Chat #54488

philipp-spiess · 2023-06-30T12:33:25Z

This is a follow-up to @mrnugget's PR that added a new site config API to expose server-side token limits.

For now we only implement this for the chat providers and we also use hardcoded response limits.

Test plan

Defaults

With local config overwrite (to stay backward compatible for now)

With site config setting

sourcegraph-bot · 2023-06-30T12:44:50Z

📖 Storybook live preview

mrnugget · 2023-06-30T13:15:03Z

client/cody-shared/src/sourcegraph-api/graphql/client.ts

+        return this.fetchSourcegraphAPI<APIResponse<any>>(CURRENT_SITE_CODY_LLM_CONFIGURATION, {}).then(response =>
+            extractDataOrError(response, data => data.site?.codyLLMConfiguration)
+        )
+    }


What if the server is still on 5.0? I think we need to be backwards compatible and make this here more generic to also check for others fields:

sourcegraph/client/cody-shared/src/sourcegraph-api/graphql/client.ts

Lines 171 to 178 in 3356721

public async getSiteHasIsCodyEnabledField(): Promise<boolean | Error> {

return this.fetchSourcegraphAPI<APIResponse<SiteGraphqlFieldsResponse>>(

CURRENT_SITE_GRAPHQL_FIELDS_QUERY,

{}

).then(response =>

extractDataOrError(response, data => !!data.__type?.fields?.find(field => field.name === 'isCodyEnabled'))

)

}

I think this should work but I haven't tested it - Do you know if we have an instance hosted that I can use or do I have to reset my dev box to test?

We gracefully "ignore" errors and make sure that the thing returned from GraphQL here can be undefined too

You can easily run the Docker images:

docker run --publish 7080:7080 --publish 127.0.0.1:3370:3370 --rm --volume ~/.sourcegraph-docker/config:/etc/sourcegraph --volume ~/.sourcegraph-docker/data:/var/opt/sourcegraph sourcegraph/server:5.0

This runs 5.0, for example.

Or you comment out this line in your local dev instance:

sourcegraph/cmd/frontend/graphqlbackend/schema.graphql

Line 7121 in 59a7baf

codyLLMConfiguration: CodyLLMConfiguration

That will disable the resolver.

As for "it should work": it might work client-side, but I do think it'll produce errors on the backend and while we don't have extensive error reporting today on response codes, I don't think we should rely on server errors not bubbling up in client if we're already fetching the schema anyways.

Side-comment: it should be a 4xx error, so I don't feel terribly strong about this

With the resolver disabled on the local instance, the API returns a 200 response with the errors object. Let's address this in a follow-up if needed because it's unclear if the current behavior negatively affects backend.

{ "errors": [ { "message": "Cannot query field \"codyLLMConfiguration\" on type \"Site\". Did you mean \"configuration\"?", "locations": [ { "line": 4, "column": 9 } ] } ] }

eseliger · 2023-07-03T15:50:32Z

client/cody/src/chat/ChatViewProvider.ts

+        const authStatus = this.authProvider.getAuthStatus()
+
+        if (authStatus.configOverwrites?.chatModelMaxTokens) {
+            return authStatus.configOverwrites.chatModelMaxTokens - ANSWER_TOKENS


do we intentionally ignore const solutionLimit = codyConfig.get<number>('provider.limit.solution') || ANSWER_TOKENS here?

we should add a small safety buffer of 100 tokens here, because we only do estimations and they might be slightly off.
The backend should still return the true values.

also, local config should probably take precedence?

local config should probably take precedence?

I would expect that.

we should add a small safety buffer of 100 tokens here, because we only do estimations and they might be slightly off.

@eseliger could you elaborate on that?

UPD: I added a buffer but I need clarification on whether this is what we need 🙂

valerybugakov

Hey @mrnugget and @eseliger, I prepped the PR for merge. Let me know if it looks good for you.

eseliger · 2023-07-05T09:48:41Z

client/cody/src/chat/ChatViewProvider.ts

+            return tokenLimit - localSolutionLimit
+        }
+
+        // TODO: add comment on why SAFETY_ANSWER_TOKENS is required here.


this safety threshold is actually for the prompt, not the answer.

The problem with a token limit for the prompt is that we can only estimate tokens (and do so in a very cheap way), so it can be that we undercount tokens. If we exceed the maximum tokens, things will start to break, so we should have some safety cushion for when we're wrong in estimating.

Ie.: Long text, 10000 characters, we estimate it to be 2500 tokens. That would fit into a limit of 3000 tokens easily. Now, it's actually 3500 tokens, because it splits weird and our estimation is off, it will fail. That's where we want to add this safety cushion in :)

Thanks, Erik; I added your comment to the sources!

eseliger

Other than my inline comment for what the cushion is for, LGTM - Thank you Philipp and Valery!

philipp-spiess · 2023-07-17T08:10:40Z

Thanks for fixing this up for me @valerybugakov :)

Cody: Add support for server-side token limits to Chat

93dd3c8

philipp-spiess requested review from mrnugget, valerybugakov and a team June 30, 2023 12:33

philipp-spiess self-assigned this Jun 30, 2023

cla-bot bot added the cla-signed label Jun 30, 2023

philipp-spiess added 3 commits June 30, 2023 14:33

Add PR

9aaf83f

Merge remote-tracking branch 'origin/main' into ps/cody-limits

9433100

Merge

b44f4c2

mrnugget reviewed Jun 30, 2023

View reviewed changes

eseliger reviewed Jul 3, 2023

View reviewed changes

eseliger approved these changes Jul 3, 2023

View reviewed changes

eseliger mentioned this pull request Jul 3, 2023

Add support for server-side token limits for LLM models sourcegraph/cody#94

Closed

valerybugakov added 3 commits July 5, 2023 10:25

Merge branch 'main' into ps/cody-limits

78b9057

cody: prioritize local LLM config

cf53053

cody: fix TS check

b1ce4e9

valerybugakov reviewed Jul 5, 2023

View reviewed changes

eseliger reviewed Jul 5, 2023

View reviewed changes

eseliger approved these changes Jul 5, 2023

View reviewed changes

cody: add comment

e29d06b

valerybugakov enabled auto-merge (squash) July 5, 2023 10:05

valerybugakov added the cody label Jul 5, 2023

valerybugakov self-assigned this Jul 5, 2023

valerybugakov merged commit 29b8d4e into main Jul 5, 2023
13 of 16 checks passed

valerybugakov deleted the ps/cody-limits branch July 5, 2023 10:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cody: Add support for server-side token limits to Chat #54488

Cody: Add support for server-side token limits to Chat #54488

philipp-spiess commented Jun 30, 2023 •

edited by mrnugget

Loading

sourcegraph-bot commented Jun 30, 2023 •

edited

Loading

mrnugget Jun 30, 2023

philipp-spiess Jun 30, 2023

mrnugget Jul 3, 2023

eseliger Jul 3, 2023

valerybugakov Jul 5, 2023 •

edited

Loading

eseliger Jul 3, 2023

eseliger Jul 3, 2023

eseliger Jul 3, 2023

valerybugakov Jul 5, 2023

valerybugakov Jul 5, 2023 •

edited

Loading

valerybugakov left a comment

eseliger Jul 5, 2023

valerybugakov Jul 5, 2023 •

edited

Loading

eseliger left a comment

philipp-spiess commented Jul 17, 2023

	public async getSiteHasIsCodyEnabledField(): Promise<boolean \| Error> {
	return this.fetchSourcegraphAPI<APIResponse<SiteGraphqlFieldsResponse>>(
	CURRENT_SITE_GRAPHQL_FIELDS_QUERY,
	{}
	).then(response =>
	extractDataOrError(response, data => !!data.__type?.fields?.find(field => field.name === 'isCodyEnabled'))
	)
	}

Cody: Add support for server-side token limits to Chat #54488

Cody: Add support for server-side token limits to Chat #54488

Conversation

philipp-spiess commented Jun 30, 2023 • edited by mrnugget Loading

Test plan

Defaults

With local config overwrite (to stay backward compatible for now)

With site config setting

sourcegraph-bot commented Jun 30, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

valerybugakov Jul 5, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

valerybugakov Jul 5, 2023 • edited Loading

Choose a reason for hiding this comment

valerybugakov left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

valerybugakov Jul 5, 2023 • edited Loading

Choose a reason for hiding this comment

eseliger left a comment

Choose a reason for hiding this comment

philipp-spiess commented Jul 17, 2023

philipp-spiess commented Jun 30, 2023 •

edited by mrnugget

Loading

sourcegraph-bot commented Jun 30, 2023 •

edited

Loading

valerybugakov Jul 5, 2023 •

edited

Loading

valerybugakov Jul 5, 2023 •

edited

Loading

valerybugakov Jul 5, 2023 •

edited

Loading