codeintel: Auto-inference sandbox API #33756

efritz · 2022-04-11T22:19:14Z

This PR adds an initial internal/codeintel/autoindexing/internal/inference package that encodes the rules of our autoindexing inference in a Lua sandbox. This package is currently unused by any entrypoint (outside of unit tests) and subsequent efforts (#33044 and #33045) will replace this code with the original.

Closes #33041 and #33040.

This PR does not close discussion on API design. This PR has roughed-out what seems to be a very usable interface for us to be able to extend. We're still waiting on customer feedback (see #33047) to help us ensure we're solving the right expressibility problem, but it doesn't seem to be a blocker for moving the PoC forward (but may be a blocker before moving onto or past #33046).

Reviewer hints:

Please review by-commit, paying special attention to the following:

766e254 adds the entire service implementation
e1894e1 reimplements the logic of lib/codeintel/autoindex in Lua
7eac9d0 adds unit tests

Test plan

This PR adds significantly good coverage and shows that we have one-to-one behavior with our existing indexers (test cases are taken verbatim). Additional testing will be done before this code is hooked up to a production (or even development application) entrypoint.

internal/luasandbox/codeintel/lua/recognizers.lua

internal/codeintel/autoindexing/inference/lua/util.lua

mrnugget · 2022-04-20T08:34:45Z

High-level question: the PR description says this closes #33041 and #33040 which belong to milestones 1 and 4 respectively. This comment in the RFC makes it sound like we still haven't finished milestone 0 yet.

So why do this now? Did we actually have to implement this to get some feedback from customers? Because if we haven't heard from customers yet on RFC 624 and whether it'll solve their problem, this is a lot of code to build on a hunch. What I'm thinking is: could we have written pseudo Lua code and asked them whether they'd use it and whether that would work for their codebase?

efritz · 2022-04-21T13:05:56Z

High-level response: what happened in the week while @mrnugget was OoO and some misc thoughts:

We've reached out to four customers this iteration to adopt auto-indexing. One of them is blocked on their internal Java team. One of them uses primarily Go, so we're all set for them. The other two we've asked specific questions about their repository setup and how they would auto-indexing their code by hand. They've both responded this morning so we'll be following up to gather more useful information.
Increasing momentum on people setting up auto-indexing and running into the "what now" part when it comes to getting it to configure jobs is putting pressure on having some sort of inference-logic-as-configuration solution. The majority of RFC 624 is a direct solution to this (with some additional work on exporting it for users as polished interface). I feel that M0 (API validation) is a blocker for making it a user-facing feature, but not to do the technical work up front to give us the benefit of modifying a running instance without having to hard-code customer-specific rules in the next patch release.
This PR combines parts of M1 (Lua port of lib/codeintel/autoindexing/inference and M4 (update/optimize gitserver communication) as it turns out it would have been more total work to do a direct port in M1 and then change it later for M4. @tjdevries gave a lot of insights to help inform these decisions as well.

What we should do going forward:

Land M1 and M4 (this PR)
Add additional rate limiting and observation around gitserver calls from Lua (M2: harden gitserver client). We've made the number of requests more efficient across all recognizers, but we're still changing the exact set of things we're running and can't be too careful.
Add a backdoor way to insert configuration into an instance so that the next release will have some mid-release way to say "Ooh you want support for pom.xml.rft? No problem!" and can give them a string to insert directly into their database for now.
Later (not urgently) polish this and advertise it as a possibility for users (not an ace-in-our-sleeve during experiments).

I'm very confident that the solution here is not far off from the solution we'd get if we blocked on completion of M0. We have enough uses cases we don't support today for Java alone (no build tooling support today, only package repos) that the API in this PR is perfectly suited for.

Completion of M0 is still super important because the Lua sandbox in this PR is not yet ergonomic. It's a bare-bones v1 (that improves on the existing art), but it's not idiomatic and has no standard library attached yet. (I'll be deferring heavily to customer response and @tjdevries's feedback here).

olafurpg

LGTM 👍 I like the new lua inference logic and it will be a gamechanger for users to be able to roll out custom inference changes.

internal/codeintel/autoindexing/inference/lua/util.lua

internal/codeintel/autoindexing/inference/service_test.go

internal/codeintel/autoindexing/inference/libs.go

internal/codeintel/autoindexing/inference/lua/go.lua

internal/codeintel/autoindexing/inference/lua/typescript.lua

efritz · 2022-04-22T00:43:38Z

Enabling auto-merge now, but would be very happy to receive another review. We'll be iterating on this in the very near future.

cla-bot bot added the cla-signed label Apr 11, 2022

efritz self-assigned this Apr 11, 2022

efritz force-pushed the ef/playground branch 11 times, most recently from cf65be2 to 8aab4e2 Compare April 13, 2022 19:59

tjdevries reviewed Apr 13, 2022

View reviewed changes

internal/luasandbox/codeintel/lua/recognizers.lua Outdated Show resolved Hide resolved

tjdevries reviewed Apr 13, 2022

View reviewed changes

internal/luasandbox/codeintel/lua/recognizers.lua Outdated Show resolved Hide resolved

efritz force-pushed the ef/playground branch from 333f746 to 595e8dc Compare April 13, 2022 21:33

efritz mentioned this pull request Apr 13, 2022

luasandbox: Add additional utils #33880

Merged

efritz force-pushed the ef/playground branch 13 times, most recently from 3bca35d to b8b1575 Compare April 19, 2022 12:39

efritz requested a review from Strum355 April 19, 2022 20:34

efritz commented Apr 19, 2022

View reviewed changes

internal/codeintel/autoindexing/inference/lua/util.lua Outdated Show resolved Hide resolved

efritz requested review from olafurpg and removed request for varungandhi-src April 21, 2022 13:56

olafurpg approved these changes Apr 21, 2022

View reviewed changes

internal/codeintel/autoindexing/inference/lua/util.lua Outdated Show resolved Hide resolved

internal/codeintel/autoindexing/inference/service_test.go Outdated Show resolved Hide resolved

tjdevries reviewed Apr 21, 2022

View reviewed changes

internal/codeintel/autoindexing/inference/libs.go Outdated Show resolved Hide resolved

tjdevries reviewed Apr 21, 2022

View reviewed changes

internal/codeintel/autoindexing/inference/lua/go.lua Outdated Show resolved Hide resolved

tjdevries reviewed Apr 21, 2022

View reviewed changes

internal/codeintel/autoindexing/inference/lua/go.lua Outdated Show resolved Hide resolved

tjdevries reviewed Apr 21, 2022

View reviewed changes

internal/codeintel/autoindexing/inference/lua/typescript.lua Outdated Show resolved Hide resolved

This was referenced Apr 21, 2022

codeintel: Auto-inference sandbox - Unify require and loadfile #34295

Closed

codeintel: Auto-inference sandbox - Type transformation #34296

Closed

codeintel: Auto-inference sandbox - Adopt an stdlib #34297

Closed

efritz added 7 commits April 21, 2022 19:04

Merge branch 'main' into ef/playground

62cbf72

Move into internal package.

504cba8

Separate test cases from test machinery.

082d762

Split tests by language.

5ecea13

Commit changed but unstaged changes from last commit.

fda27dd

Do the paths thing.

0faff06

Rename callback to be a bit less generic.

90eb1f7

efritz requested a review from tjdevries April 22, 2022 00:32

WIP.

3a30c9c

efritz enabled auto-merge (squash) April 22, 2022 00:43

efritz disabled auto-merge April 22, 2022 01:02

efritz enabled auto-merge (squash) April 22, 2022 01:02

efritz merged commit 273d79b into main Apr 22, 2022

efritz deleted the ef/playground branch April 22, 2022 01:07

efritz mentioned this pull request Apr 22, 2022

RFC 624: (M4) Design new recognizer interface #33040

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codeintel: Auto-inference sandbox API #33756

codeintel: Auto-inference sandbox API #33756

efritz commented Apr 11, 2022 •

edited

mrnugget commented Apr 20, 2022

efritz commented Apr 21, 2022

olafurpg left a comment

efritz commented Apr 22, 2022

codeintel: Auto-inference sandbox API #33756

codeintel: Auto-inference sandbox API #33756

Conversation

efritz commented Apr 11, 2022 • edited

Test plan

mrnugget commented Apr 20, 2022

efritz commented Apr 21, 2022

olafurpg left a comment

Choose a reason for hiding this comment

efritz commented Apr 22, 2022

efritz commented Apr 11, 2022 •

edited