x/tools/gopls: express completion scoring in terms of down-ranking factors

While investigating #62665, I noticed that there were lots of opportunities to optimize deep completion (see also #63263). However, one obstacle to optimization is the fact that completion candidate score adjustment occurs in multiple places, and is a combination of [multiplication by numbers >1 and <1](https://cs.opensource.google/go/x/tools/+/master:gopls/internal/lsp/source/completion/completion.go;l=2895;drc=7f23bc81dc216f83d56b5256abc053109bf5c58b), [explicit setting of score](https://cs.opensource.google/go/x/tools/+/master:gopls/internal/lsp/source/completion/completion.go;l=1105;drc=7f23bc81dc216f83d56b5256abc053109bf5c58b), and [subtraction](https://cs.opensource.google/go/x/tools/+/master:gopls/internal/lsp/source/completion/completion.go;l=1479;drc=7f23bc81dc216f83d56b5256abc053109bf5c58b).

It's clear that a lot of thought and experimentation went into the current scoring, and it works well overall. However, the way scoring is expressed makes it hard to reason about and hard to refactor (because operations don't commute), and therefore hard to improve.

Additionally, it means that we can never determine that the score is already too low and stop doing work. For example, in one of our kubernetes benchmarks we spend approximately 80ms in type checking and "normal" completion combined, and then another 100-200ms doing deep completion. All of that extra work produces [at most 3 additional candidates](https://cs.opensource.google/go/x/tools/+/master:gopls/internal/lsp/source/completion/deep_completion.go;l=16;drc=6ccb09c054185e349ed607260dade4508463256f). It is likely that by doing cheaper scoring operations first, we could immediately invalidate most of the candidates without having to do more expensive operations such as fuzzy matching and type inference.

I therefore believe that a first step toward improving completions should be to enforce the following rule: _**scores only go down, by multiplicative factors**_. In other words:
- no adding/subtracting to the score
- no multiplying the score by a number >1
- no explicit setting of the score to a constant

I think we can make this change in the existing codebase without changing anything else: each scoring operation needs to be recalibrated to express its outcome using a down-ranking factor.

Unfortunately, there is a lot of knowledge embedded in the current scoring, and therefore making this change naively is likely to result in inferior completions in the short term. @pjweinb has been looking at some large scale testing that may be useful for calibrating completion results, and may help us make this change without regression (or, perhaps, while simultaneously _improving_ results).

CC @adonovan @muirdm @heschi, who have experience with the completion code. WDYT?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

x/tools/gopls: express completion scoring in terms of down-ranking factors #63282

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

x/tools/gopls: express completion scoring in terms of down-ranking factors #63282

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions