REPLCompletions: replace `get_type` by the proper inference #49206

aviatesk · 2023-03-31T16:01:12Z

This PR generalizes the idea from #49199 and uses inference to analyze
the types of REPL expression. This approach offers several advantages
over the current get_[value|type]-based implementation:

The need for various special cases is eliminated, as lowering normalizes
expressions, and inference handles all language features.
Constant propagation allows us to obtain accurate completions for complex
expressions safely (see Use const propagating inference in REPL tab completions #36437).

Analysis on arbitrary REPL expressions can be done by the following steps:

Lower a given expression
Form a top-level MethodInstance from the lowered expression
Run inference on the top-level MethodInstance

This PR implements REPLInterpreter, a custom AbstractInterpreter that:

aggressively resolves global bindings to enable reasonable completions
for lines like Mod.a.| (where | is the cursor position)
aggressively concrete-evaluates :inconsistent calls to provide
reasonable completions for cases like Ref(Some(42))[].|
does not optimize the inferred code, as REPLInterpreter is only used
to obtain the type or constant information of given expressions

Aggressive binding resolution presents challenges for REPLInterpreter's
cache validation (since #40399 hasn't been resolved yet). To avoid cache
validation issue, REPLInterpreter only allows aggressive binding
resolution for top-level frame representing REPL input code
(repl_frame) and for child getproperty frames that are
constant propagated from the repl_frame. This works, since
1.) these frames are never cached, and
2.) their results are only observed by the non-cached repl_frame

REPLInterpreter also aggressively concrete-evaluates :inconsistent
calls within repl_frame, allowing it to get get accurate type
information about complex expressions that otherwise can not be constant
folded, in a safe way, i.e. it still doesn't evaluate effectful
expressions like pop!(xs). Similarly to the aggressive binding
resolution, this aggressive concrete evaluation doesn't present any cache
validation issues because it is limited to repl_frame that is never cached.

Also note that the code cache for REPLInterpreter is separated from the
native code cache, ensuring that code caches produced by REPLInterpreter,
where bindings are aggressively resolved and the code is not really
optimized, do not affect the native code execution. A hack has
also been added to avoid serializing CodeInstancess produced by
REPLInterpreter during precompilation to workaround #48453.

closes #36437
replaces #49199

aviatesk · 2023-03-31T16:11:16Z

@nanosoldier runbenchmarks(!"scalar", vs=":master")

oscardssmith · 2023-03-31T16:51:19Z

Can you make it so this is able to concrete eval through non-consistent code?

aviatesk · 2023-03-31T16:58:57Z

It might be reasonable to enable concrete eval for inconsistent calls limitedly, but we first need to finish #47154 in order to constant fold arrays.

stdlib/REPL/test/replcompletions.jl

staticfloat · 2023-03-31T17:16:05Z

as a fun little bonus, this approach is actually more correct when analyzing the degenerate case:

struct UnstableFoo
end
function Base.propertynames(::UnstableFoo)
    if rand() > 0.5
        return (:a, :b)
    else
        return (:a, :b, :c)
    end
end

struct Wrap
    a
end

julia> Wrap(UnstableFoo()).a.
a  b  c
julia> Wrap(UnstableFoo()).a.
a  b

staticfloat · 2023-03-31T17:37:45Z

Hmmm, playing around with this a bit, I did notice that unusual symbol names (such as those that might require use of var"" to access them) can confuse the tab autocompletion, since the var"" syntax looks like a string, and so it starts to auto-suggest filenames, rather than symbol names:

julia> struct WeirdNames
       end
       Base.propertynames(::WeirdNames) = (Symbol("oh no!"), Symbol("oh yes!"))

julia> WeirdNames().var"oh 
.buildkite-external-version  .clang-format                .clangd                      .codecov.yml                 .devcontainer/               .git-blame-ignore-revs
.git/                        .gitattributes               .github/                     .gitignore                   .mailmap                     CITATION.bib
CITATION.cff                 CONTRIBUTING.md              HISTORY.md                   LICENSE.md                   Make.inc                     Makefile
NEWS.md                      README.md                    THIRDPARTY.md                VERSION                      base/                        cli/
contrib/                     deps/                        doc/                         etc/                         julia                        julia.spdx.json
pkgimage.mk                  src/                         stdlib/                      sysimage.mk                  test/                        usr-staging/
usr/

nanosoldier · 2023-03-31T22:07:49Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

aviatesk · 2023-04-01T07:20:04Z

I did notice that unusual symbol names (such as those that might require use of var"" to access them) can confuse the tab autocompletion, since the var"" syntax looks like a string, and so it starts to auto-suggest filenames, rather than symbol names:

Yes, you're right. This specific issue isn't related to this PR since it comes from our handling of REPL input code, which occurs before the type analysis step. We should definitely improve it, likely in a separate PR.

aviatesk · 2023-04-01T07:21:08Z

Can you make it so this is able to concrete eval through non-consistent code?

Implemented. Now we can get completions for cases like Ref(Some(42))[].|. Combined with #47154, we will be able to get completions for cases like Any[Some(42)].| also.

aviatesk · 2023-04-01T07:22:01Z

@nanosoldier runbenchmarks("linalg" || "inference" || "sparse", vs=":master")

nanosoldier · 2023-04-01T09:10:23Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

This PR generalizes the idea from #49199 and uses inference to analyze the types of REPL expression. This approach offers several advantages over the current `get_[value|type]`-based implementation: - The need for various special cases is eliminated, as lowering normalizes expressions, and inference handles all language features. - Constant propagation allows us to obtain accurate completions for complex expressions safely (see #36437). Analysis on arbitrary REPL expressions can be done by the following steps: - Lower a given expression - Form a top-level `MethodInstance` from the lowered expression - Run inference on the top-level `MethodInstance` This PR implements `REPLInterpreter`, a custom `AbstractInterpreter` that: - aggressively resolve global bindings to enable reasonable completions for lines like `Mod.a.|` (where `|` is the cursor position) - aggressively concrete evaluates `:inconsistent` calls to provide reasonable completions for cases like `Ref(Some(42))[].|` - does not optimize the inferred code, as `REPLInterpreter` is only used to obtain the type or constant information of given expressions Aggressive binding resolution presents challenges for `REPLInterpreter`'s cache validation (since #40399 hasn't been resolved yet). To avoid cache validation issue, `REPLInterpreter` only allows aggressive binding resolution for top-level frame representing REPL input code (`repl_frame`) and for child `getproperty` frames that are constant propagated from the `repl_frame`. This works, since 1.) these frames are never cached, and 2.) their results are only observed by the non-cached `repl_frame` `REPLInterpreter` also aggressively concrete evaluate `:inconsistent` calls within `repl_frame`, allowing it to get get accurate type information about complex expressions that otherwise can not be constant folded, in a safe way, i.e. it still doesn't evaluate effectful expressions like `pop!(xs)`. Similarly to the aggressive binding resolution, aggressive concrete evaluation doesn't present any cache validation issues because `repl_frame` is never cached. Also note that the code cache for `REPLInterpreter` is separated from the native code cache, ensuring that code caches produced by `REPLInterpreter`, where bindings are aggressively resolved and the code is not really optimized, do not affect the native code execution. A hack has also been added to avoid serializing `CodeInstances`s produced by `REPLInterpreter` during precompilation to workaround #48453. closes #36437 replaces #49199

StefanKarpinski · 2023-04-03T20:25:54Z

What a cool and practical use of our effect analysis!

…g#49206) This PR generalizes the idea from JuliaLang#49199 and uses inference to analyze the types of REPL expression. This approach offers several advantages over the current `get_[value|type]`-based implementation: - The need for various special cases is eliminated, as lowering normalizes expressions, and inference handles all language features. - Constant propagation allows us to obtain accurate completions for complex expressions safely (see JuliaLang#36437). Analysis on arbitrary REPL expressions can be done by the following steps: - Lower a given expression - Form a top-level `MethodInstance` from the lowered expression - Run inference on the top-level `MethodInstance` This PR implements `REPLInterpreter`, a custom `AbstractInterpreter` that: - aggressively resolve global bindings to enable reasonable completions for lines like `Mod.a.|` (where `|` is the cursor position) - aggressively concrete evaluates `:inconsistent` calls to provide reasonable completions for cases like `Ref(Some(42))[].|` - does not optimize the inferred code, as `REPLInterpreter` is only used to obtain the type or constant information of given expressions Aggressive binding resolution presents challenges for `REPLInterpreter`'s cache validation (since JuliaLang#40399 hasn't been resolved yet). To avoid cache validation issue, `REPLInterpreter` only allows aggressive binding resolution for top-level frame representing REPL input code (`repl_frame`) and for child `getproperty` frames that are constant propagated from the `repl_frame`. This works, since 1.) these frames are never cached, and 2.) their results are only observed by the non-cached `repl_frame` `REPLInterpreter` also aggressively concrete evaluate `:inconsistent` calls within `repl_frame`, allowing it to get get accurate type information about complex expressions that otherwise can not be constant folded, in a safe way, i.e. it still doesn't evaluate effectful expressions like `pop!(xs)`. Similarly to the aggressive binding resolution, aggressive concrete evaluation doesn't present any cache validation issues because `repl_frame` is never cached. Also note that the code cache for `REPLInterpreter` is separated from the native code cache, ensuring that code caches produced by `REPLInterpreter`, where bindings are aggressively resolved and the code is not really optimized, do not affect the native code execution. A hack has also been added to avoid serializing `CodeInstances`s produced by `REPLInterpreter` during precompilation to workaround JuliaLang#48453. closes JuliaLang#36437 replaces JuliaLang#49199

This updates the code taken from `REPL.REPLCompletions` with the changes introduced in JuliaLang/julia#49206. Otherwise, `using FuzzyCompletions` will complain about `get_type` and `get_value` being undefined when precompiling and completely fail when being used on the upcoming Julia 1.10 release. Co-authored-by: Shuhei Kadowaki <aviatesk@gmail.com>

Adjusts to JuliaLang/julia#49206.

aviatesk requested review from staticfloat and vtjnash March 31, 2023 16:01

aviatesk mentioned this pull request Mar 31, 2023

Use Abstract Interpretation to search for overridden property names #49199

Closed

staticfloat assigned aviatesk Mar 31, 2023

staticfloat approved these changes Mar 31, 2023

View reviewed changes

stdlib/REPL/test/replcompletions.jl Outdated Show resolved Hide resolved

stdlib/REPL/test/replcompletions.jl Outdated Show resolved Hide resolved

aviatesk force-pushed the avi/36437 branch from 0da05f4 to e2e238b Compare April 1, 2023 07:16

aviatesk force-pushed the avi/36437 branch from e2e238b to d4240bf Compare April 2, 2023 01:24

aviatesk force-pushed the avi/36437 branch from d4240bf to e2932cf Compare April 2, 2023 05:28

staticfloat approved these changes Apr 3, 2023

View reviewed changes

oscardssmith merged commit 98988d8 into master Apr 3, 2023

oscardssmith deleted the avi/36437 branch April 3, 2023 20:29

oscardssmith added the stdlib:REPL Julia's REPL (Read Eval Print Loop) label Apr 3, 2023

Keno mentioned this pull request Apr 6, 2023

Tab completion doesn't work correctly for var"" symbols #49280

Closed

oscardssmith mentioned this pull request Apr 19, 2023

time to first tab complete #49415

Closed

fingolfin mentioned this pull request May 4, 2023

Warning on julia master oscar-system/GAP.jl#864

Closed

antoine-levitt mentioned this pull request May 5, 2023

Nested autocompletion JuliaPy/PyCall.jl#667

Open

Pangoraw mentioned this pull request Aug 7, 2023

Update complete_symbol with changes from https://github.com/JuliaLang/julia/pull/49206 JunoLab/FuzzyCompletions.jl#14

Merged

aviatesk added a commit to JunoLab/FuzzyCompletions.jl that referenced this pull request Aug 24, 2023

Merge pull request #14 from Pangoraw/update_for_1_10

f4b049e

Adjusts to JuliaLang/julia#49206.

oscardssmith mentioned this pull request Sep 5, 2023

Tab expansion of NamedTuple fails on 1.10.0-beta2 #51194

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REPLCompletions: replace `get_type` by the proper inference #49206

REPLCompletions: replace `get_type` by the proper inference #49206

aviatesk commented Mar 31, 2023 •

edited

Loading

aviatesk commented Mar 31, 2023

oscardssmith commented Mar 31, 2023

aviatesk commented Mar 31, 2023 •

edited

Loading

staticfloat commented Mar 31, 2023

staticfloat commented Mar 31, 2023

nanosoldier commented Mar 31, 2023

aviatesk commented Apr 1, 2023

aviatesk commented Apr 1, 2023

aviatesk commented Apr 1, 2023

nanosoldier commented Apr 1, 2023

StefanKarpinski commented Apr 3, 2023

REPLCompletions: replace get_type by the proper inference #49206

REPLCompletions: replace get_type by the proper inference #49206

Conversation

aviatesk commented Mar 31, 2023 • edited Loading

aviatesk commented Mar 31, 2023

oscardssmith commented Mar 31, 2023

aviatesk commented Mar 31, 2023 • edited Loading

staticfloat commented Mar 31, 2023

staticfloat commented Mar 31, 2023

nanosoldier commented Mar 31, 2023

aviatesk commented Apr 1, 2023

aviatesk commented Apr 1, 2023

aviatesk commented Apr 1, 2023

nanosoldier commented Apr 1, 2023

StefanKarpinski commented Apr 3, 2023

REPLCompletions: replace `get_type` by the proper inference #49206

REPLCompletions: replace `get_type` by the proper inference #49206

aviatesk commented Mar 31, 2023 •

edited

Loading

aviatesk commented Mar 31, 2023 •

edited

Loading