Cody completions: Setup to Bring-your-own-LLM #53495

philipp-spiess · 2023-06-14T15:55:10Z

This PR restructures the way we generate completions in a way that makes it very easy and cheap for us to try out different completion providers.

The existing behavior was kept MOSTLY the same, however some minor tweaks were made in an attempt to reduce complexity:

The getContext function now has a higher number of characters that it gets to generate. We previously calculated the exact length we have for context by generating an empty prompt containing only the prefix and suffix. This caused quite a bit of overhead and back and forth to negotiate the right value. The new behavior is to use the total context length as an upper bound for the context. This means context is now slightly over-fetched, but in practice this should not be a big deal. The individual providers still need to make sure they only include context when it fits (a logic that was previously implemented already)
The anthropic answer tokens were changed from 200 to 204 since that would allow us to change from hardcoded numbers to percentage and it looked nice (10% of the total tokens are now used for the answer).

Controversial:

The logic where inline completions in lines that were not empty was causing 2 normal requests and one request with a \n injected in the prompt was preliminary removed. Do we want it back? If so, it should be branched off in the anthropic provider only.
- My personal take on this is that it was always confusing to me as to why we do it and I have not observed it causing any major gains personally. I’m fine with dropping it for now and adding back if we do see issues but I suspect that a different model will bring us more gains.

ToDo:

(Will be addressed before a merge but does not need to block a first code review):

Add proper context to the backend API (this will be added in another PR)
Increase the response token size for multi-line completion and maybe reduce the response size for single-line completions (to improve latency)
Add request logging so we know what the fetch is doing
Add logging on startup to show what backend its using
Handle abort case
check why 4 requests

Test plan

Test like you normally would - Nothing should have changed!
To test a new provider:
- First start the mock server cd client/cody && pnpm ts-node ./scripts/mock-server.ts
- Then add the following two options into your config:
```
 "cody.completions.advanced.provider": "unstable-codegen",
 "cody.completions.advanced.serverEndpoint": "http://localhost:3001/batch"
```
- Observe that it works 😮

…essing into new file

…yollm

philipp-spiess · 2023-06-15T16:59:19Z

client/cody/src/completions/manual.ts

This behavior was forked off to reduce complexity in the other files. We'll update this to use the new providers interface as a follow-up

sourcegraph-bot · 2023-06-15T22:22:59Z

📖 Storybook live preview

valerybugakov · 2023-06-16T02:41:12Z

client/cody/src/completions/completion.test.ts

-        undefined,
-        undefined,


Excellent, it's getting smaller!

valerybugakov · 2023-06-16T02:43:53Z

client/cody/src/completions/index.ts

        }

        CompletionLogger.noResponse(logId)
        return []
    }
-
-    public async fetchAndShowManualCompletions(): Promise<void> {


valerybugakov

Great stuff, Philipp! 🔥

client/cody/src/completions/providers/anthropic.ts

client/cody/src/completions/providers/provider.ts

client/cody/src/completions/providers/unstable-codegen.ts

client/cody/src/event-logger.ts

…yollm

Co-authored-by: Valery Bugakov <skymk1@gmail.com>

This PR restructures the way we generate completions in a way that makes it very easy and cheap for us to try out different completion providers. The existing behavior was kept MOSTLY the same, however some minor tweaks were made in an attempt to reduce complexity: - The getContext function now has a higher number of characters that it gets to generate. We previously calculated the _exact length_ we have for context by generating an empty prompt containing only the prefix and suffix. This caused quite a bit of overhead and back and forth to negotiate the right value. The new behavior is to use the total context length as an upper bound for the context. This means context is now slightly over-fetched, but in practice this should not be a big deal. The individual providers still need to make sure they only include context when it fits (a logic that was previously implemented already) - The anthropic answer tokens were changed from `200` to `204` since that would allow us to change from hardcoded numbers to percentage and it looked nice (10% of the total tokens are now used for the answer). ## Controversial: - The logic where inline completions in lines that were not empty was causing 2 normal requests _and one request with a `\n` injected in the prompt_ was preliminary removed. Do we want it back? If so, it should be branched off in the anthropic provider only. - My personal take on this is that it was always confusing to me as to why we do it and I have not observed it causing any major gains _personally_. I’m fine with dropping it for now and adding back if we do see issues but I suspect that a different model will bring us more gains. ## ToDo: (Will be addressed before a merge but does not need to block a first code review): - [ ] Add proper context to the backend API (this will be added in another PR) - [x] Increase the response token size for multi-line completion and maybe reduce the response size for single-line completions (to improve latency) - [x] Add request logging so we know what the fetch is doing - [x] Add logging on startup to show what backend its using - [x] Handle abort case - [x] check why 4 requests ## Test plan - Test like you normally would - Nothing should have changed! - To test a new provider: - First start the mock server `cd client/cody && pnpm ts-node ./scripts/mock-server.ts` - Then add the following two options into your config: ``` "cody.completions.advanced.provider": "unstable-codegen", "cody.completions.advanced.serverEndpoint": "http://localhost:3001/batch" ``` - Observe that it works 😮  --------- Co-authored-by: Valery Bugakov <skymk1@gmail.com>

philipp-spiess added 3 commits June 14, 2023 15:36

Add generic completion mock server

44fbab7

Remove unused field

0d4ae9c

Add suggestion for new interface

0779771

cla-bot bot added the cla-signed label Jun 14, 2023

github-actions bot added the team/code-exploration Issues owned by the Code Exploration team label Jun 14, 2023

philipp-spiess force-pushed the ps/cody-completions-byollm branch from c5c3208 to aa0d816 Compare June 14, 2023 15:56

philipp-spiess added 2 commits June 15, 2023 01:02

Fork manual completions and put it into a seperate file

b5c7e0b

Migrate anthropic logic to new provider interface and split post-proc…

0bfcd3e

…essing into new file

philipp-spiess force-pushed the ps/cody-completions-byollm branch from aa0d816 to 0bfcd3e Compare June 14, 2023 23:42

philipp-spiess added 16 commits June 15, 2023 01:58

Add plan

acc0b22

Rewrite completion entry point to use new abstraction

9f57a97

Update ToDo

73a83e3

Write it up so it works again

f556fa4

Thank god we invested in unit tests

887b2d6

Fix bazel

181ebcc

Add unstable codegen provider

016853e

Update ToDo

7f85e76

Include proider id in event logs and move one TODO into the codegen file

c6d3d29

Merge remote-tracking branch 'origin/main' into ps/cody-completions-b…

df0208e

…yollm

Fix change log

0925175

Fix tests

db6506a

Remove readme

a957599

Fix linter issue

0ac0a1d

Make it so completion config changes do not require a restart

602662c

Remove todo task

dbac54c

philipp-spiess commented Jun 15, 2023

View reviewed changes

philipp-spiess added 2 commits June 15, 2023 19:17

Fixes

6760359

Fixes

d1305c4

philipp-spiess changed the title ~~Cody completions: Bring-your-own-LLM~~ Cody completions: Setup to Bring-your-own-LLM Jun 15, 2023

Fixes

2391c1d

Fixes

89ebefe

philipp-spiess requested review from valerybugakov and a team June 15, 2023 22:12

philipp-spiess marked this pull request as ready for review June 15, 2023 22:12

Various fixes and improvements

f23fc60

valerybugakov reviewed Jun 16, 2023

View reviewed changes

valerybugakov approved these changes Jun 16, 2023

View reviewed changes

philipp-spiess and others added 10 commits June 16, 2023 13:07

Improve logging and other fixes

08797f8

Merge remote-tracking branch 'origin/main' into ps/cody-completions-b…

21d42a4

…yollm

Smaller cleanups

8806371

Update client/cody/src/completions/providers/provider.ts

f428b28

Co-authored-by: Valery Bugakov <skymk1@gmail.com>

Update client/cody/src/completions/providers/provider.ts

2e2e4ce

Co-authored-by: Valery Bugakov <skymk1@gmail.com>

Update client/cody/src/completions/providers/anthropic.ts

2f69643

Co-authored-by: Valery Bugakov <skymk1@gmail.com>

Update client/cody/src/completions/providers/anthropic.ts

3d7ff09

Co-authored-by: Valery Bugakov <skymk1@gmail.com>

Fix tests and trigger completions in all files

6cf8bc8

Add changelog

08548d2

Fix TypeScript error

8e03c06

philipp-spiess mentioned this pull request Jun 16, 2023

Cody completions: experimental huggingface endpoint #51509

Closed

philipp-spiess merged commit f7ad0a6 into main Jun 16, 2023
25 of 26 checks passed

philipp-spiess deleted the ps/cody-completions-byollm branch June 16, 2023 15:28

vdavid mentioned this pull request Jun 22, 2023

JetBrains: Completions: Add settings to allow bring-your-own-LLM #53973

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cody completions: Setup to Bring-your-own-LLM #53495

Cody completions: Setup to Bring-your-own-LLM #53495

philipp-spiess commented Jun 14, 2023 •

edited

Loading

philipp-spiess Jun 15, 2023

sourcegraph-bot commented Jun 15, 2023 •

edited

Loading

valerybugakov Jun 16, 2023

valerybugakov Jun 16, 2023

valerybugakov left a comment

Cody completions: Setup to Bring-your-own-LLM #53495

Cody completions: Setup to Bring-your-own-LLM #53495

Conversation

philipp-spiess commented Jun 14, 2023 • edited Loading

Controversial:

ToDo:

Test plan

philipp-spiess Jun 15, 2023

Choose a reason for hiding this comment

sourcegraph-bot commented Jun 15, 2023 • edited Loading

valerybugakov Jun 16, 2023

Choose a reason for hiding this comment

valerybugakov Jun 16, 2023

Choose a reason for hiding this comment

valerybugakov left a comment

Choose a reason for hiding this comment

philipp-spiess commented Jun 14, 2023 •

edited

Loading

sourcegraph-bot commented Jun 15, 2023 •

edited

Loading