Description
This issue aggregates some work I'd like to do to refine the go/packages.Load
calls made by gopls. Generally speaking, these fall into the category of "using gopls' knowledge of the workspace state to refine loading patterns". This should benefit users across the board, but may in particular help for users of go/packages drivers that have different performance characteristics than the go command.
A word on terminology: we refer to the packages data returned by go/packages.Load
as package metadata.
Considering only the simple case of a module workspace, metadata loading works approximately as follows (some edge cases are ignored). All loads are inclusive of -deps
.
- On startup, load
<modulePath>/...
- When a file change affects metadata, expand that change to all "possibly affected" packages, which includes
- all packages containing the file
- if the package clause changed, all packages with files in the same directory as the file
- all packages that have import cycles (it's too hard to determine whether the change could have resolved cycles)
- When metadata is invalidated for a package, also invalidate its importers, recursively (the reverse transitive closure)
- When processing a file-oriented request such as completion, if we have no package metadata containing that file, load a
file=
query (but keep track of files that can't be loaded so we don't keep trying). - When processing a workspace-wide request such as references, load all the packages invalidated by steps (2) and (3) with a query containing all the invalidated package paths.
This loading pattern evolved to ensure we don't miss metadata changes, and can operate on a file as soon as possible. Notably, it was only chosen based on the go command behavior, and generally works because go list
is heavily optimized. Even loading a relatively large project such as Kubernetes takes only a few seconds.
However, there are some improvements to be desired:
- It would be nice to be able to handle low latency "best effort" requests such as completion with stale metadata. Most metadata changes are the addition or deletion of an import, and most type checking can still be performed (this was previously worked on in x/tools/gopls: save old metadata before invalidating #42266, though that approach had some problems). In gopls terminology, I think the right way to do this is to have loads create a new snapshot, and let some requests operate on the earlier snapshot.
- We don't necessarily need to load with
-deps
when we're just selectively reloading a file or package. We can theoretically instead load without-deps
, see which imports (if any) are missing, and then load those missing imports in a second pass. - In many cases we don't need to invalidate the reverse transitive closure recursively (again, with the common case being the addition or deletion of an import). I believe this is only necessary for x_test packages which may pick up an intermediate test variant. If we can convince ourselves of this, then maybe we only need to invalidate x_test packages in the reverse transitive closure...
- When the queue of packages to load become large, it's usually faster (even with the go command) to reload the entire workspace. We can have a threshold at which point we decide just to reload
<modulePath>/...
.
We should experiment with these improvements. I believe they should be of particular help for users of a bazel go/packages drivers, as bazel queries have significantly higher overhead than the go command.