Refactor `IPCompleter` Matcher API #13745

krassowski · 2022-09-05T06:47:45Z

Closes #12820 and closes #13735. Supersedes #13734.

Partially fixes #13155 by exposing full context to matchers (but not the hooks yet - that would be a breaking change; I can follow up with it).

This is a gentle, backward and forward-compatible rewrite of Matcher API. I attempted to design the API for performance and future extensibility, while retaining compatibility with existing code.

There is quite some legacy logic in this codebase, which I tried to carefully preserve and document by making any such behaviour explicit, but in a few places I encountered behaviour which I don't know how to tackle (no unit tests, nor comments - not sure if existing behaviour was intended or incidental due to accumulated code complexity). I would highly appreciate help on such cases, suggestions on how to handle deprecations (if any are needed) and general feedback on this PR.

User-facing changes

Completion type is returned in the provisional API for most matchers

Before	After

Dictionary completions are restricted to dictionary keys

By default completions in dictionary context will only provide dictionary key matches. This is user-configurable.

The heuristic for this can be improved in the future iterations.

Before	After

Configuration

New configuration options were added:

IPCompleter.suppress_competing_matchers allows to instruct which matchers should (or should not) short-circuit to return candidate completions ignoring other matchers
IPCompleter.disable_matchers enables selective disabling of matchers

Existing configuration options are now aliases:

IPCompleter.merge_completions = False is an alias for IPCompleter.suppress_competing_matchers = True

Matcher API

Briefly, the new Matcher is defined as:

MatcherAPIv1 = Callable[[str], list[str]]
MatcherAPIv2 = Callable[[CompletionContext], MatcherResult]

Matcher = Union[MatcherAPIv1, MatcherAPIv2]

where MatcherAPIv1 is the same public API as exposed via (undocumented) IPCompleter.custom_matchers attribute since v8.0, #12130), MatcherAPIv2 is the new API, and subsequent versions can be added in the future.

Matcher API v2

The MatcherResult of API v2 is a dictionary, as and resolves an old TODO comment:

ipython/IPython/core/completer.py

Lines 2142 to 2145 in 7f51a03

    
           # FIXME: we should extend our api to return a dict with completions for 
        
           # different types of objects.  The rlcomplete() method could then 
        
           # simply collapse the dict into a list for readline, but we'd have 
        
           # richer completion semantics in other environments.

class MatcherResult(TypedDict):
    #: list of candidate completions
    completions: Union[Sequence[SimpleCompletion], Sequence[_JediCompletionLike]]   # simplified

    #: suffix of the provided ``CompletionContext.token``, if not given defaults to full token.
    matched_fragment: NotRequired[str]

    #: whether to suppress results from all other matchers (True), some
    #: matchers (set of identifiers) or none (False); default is False.
    suppress: NotRequired[Union[bool, Set[str]]]

    #: identifiers of matchers which should NOT be suppressed
    do_not_suppress: NotRequired[Set[str]]

    #: are completions already ordered and should be left as-is? default is False.
    ordered: NotRequired[bool]

This will enable adding additional attributes to the dictionary without breaking compatibility in the future. It also makes the API simple for users, as they only need to provide {'completions': list[SimpleCompletion]}.

SimpleCompletion is a container for completion metadata allowing to add type and in future additional attributes. After profiling initialisation, different forms of attribute/item access and memory use I chose to use a custom class with slots over NamedTuple or TypedDict due to inherent performance advantage. This is consistent with Completion class which also uses __slots__.

class SimpleCompletion:
    __slots__ = ["text", "type"]

    text: str
    type: str

CompletionContext is intended to provide matchers with information they need to generate completion candidates:

class CompletionContext(NamedTuple):
    token: str
    full_text: str
    cursor_position: int
    cursor_line: int

    @cached_property
    def text_until_cursor(self) -> str:
        ...

    @cached_property
    def line_with_cursor(self) -> str:
        ...

Switching between APIs and Matcher configuration

A decorator was added enabling demarking API versions and configuring Matcher-level options. The decorator is optional for v1 API, but required to use v2 API

def completion_matcher(
    *, priority: float = None, identifier: str = None, api_version=1
):
    """Adds attributes describing the matcher.
    Parameters
    ----------
    priority : Optional[float]
        The priority of the matcher, determines the order of execution of matchers.
        Higher priority means that the matcher will be executed first. Defaults to 50.
    identifier : Optional[str]
        identifier of the matcher allowing users to modify the behaviour via traitlets,
        and also used to for debugging (will be passed as ``origin`` with the completions).
        Defaults to matcher function ``__qualname__``.
    api_version: Optional[int]
        version of the Matcher API used by this matcher.
        Currently supported values are 1 and 2.
        Defaults to 1.
    """

    def wrapper(func: Matcher):
        func.matcher_priority = priority
        func.matcher_identifier = identifier or func.__qualname__
        func.matcher_api_version = api_version
        return func

    return wrapper

Maintaining compatibility

Existing methods such as dict_key_matches were not touched; instead wrapper methods converting from API v1 to API v2 and adding desired metadata (when applicable) were added; those methods have a new suffix _matcher. This approach allows users who monkey-patch matcher v1 methods in IPCompleter to continue using their existing code without any changes.

Follow-up tasks

To be discussed in separate issues and addressed in separate PRs:

benchmark type as a string vs as an enum to inform whether it is worth switching to enum
investigate how to integrate relevance score of sort (in sortText in LSP terms)

the lowest supported Python version catches up.

krassowski · 2022-09-05T07:26:06Z

Is there an automated way to update doctest with ipdoctest, or do I copy line by line? I added new configuration with verbose explanation and it just happens that In [2]: %config IPCompleter is used as an example in IPython/core/magics/config.py.

Carreau · 2022-09-06T16:40:45Z

Is there an automated way to update doctest with ipdoctest, or do I copy line by line? I added new configuration with verbose explanation and it just happens that In [2]: %config IPCompleter is used as an example in IPython/core/magics/config.py.

No there is no automatic ways. I copy and past also with light editting.

joelostblom · 2022-09-07T18:37:17Z

Awesome! The key-only completion for dictionaries will be super convenient!

krassowski · 2022-09-07T21:35:47Z

Ok, I would say this is good for a review and hopefully merge.

I am happy to commit to maintaining this and fixing any problems that might emerge as a result of merging this (which I don't expect frankly since everything should be 100% backward compatible - but we know how it is with big projects).

Documentation-wise I hit a minor hiccup with documenting the TypedDict but I submitted a bugfix to sphinx autodoc which solves the issue sphinx-doc/sphinx#10806 and all members will show up once that is accepted and released.

IPython/core/completer.py

Carreau · 2022-09-16T12:18:27Z

IPython/core/completer.py

+        self.type = type
+
+    def __repr__(self):
+        return f"<SimpleCompletion text={self.text!r} type={self.type!r}>"


WE could almost use a dataclass.

I decided not to use a dataclass due to a performance overhead.

IPython/core/completer.py

Carreau · 2022-09-16T12:36:57Z

IPython/core/completer.py

+
+
+@sphinx_options(show_inherited_members=True, exclude_inherited_from=["dict"])
+class SimpleMatcherResult(_MatcherResultBase, TypedDict):


DO you need to inherit both if _MatcherResultBase is already TypedDict.
and do we want to TypedDict, or simply make a datalass with all the possible existing fields, even if it's slightly backward-incompatible ?

DO you need to inherit both if _MatcherResultBase is already TypedDict.

No, not at runtime, but yes if we want to have nice documentation (sphinx-doc/sphinx#10806); a quirk of how TypedDict it was implemented. I added a comment in the body on this.

and do we want to TypedDict, or simply make a datalass with all the possible existing fields, even if it's slightly backward-incompatible ?

I hesitated here and considered making it a dataclass to, but I preferred TypedDict for two reasons:

in future releases we can add a new NotRequired field without breaking compatibility (and give notice if we want to make it required in the further future)

it does not require users to import any specific class from IPython to wrap the results (or create a custom class complying with the API), so it would keep it simple (while argument name and type safety is provided by typecheckers)

IPython/utils/docs.py

docs/sphinxext/apigen.py

Carreau · 2022-09-16T13:00:20Z

Thanks, I tried to do a quick pass, and will try to get a deeper look when on my way to CZI meeting next week.

In general I think you can be a little bit less careful on backward compatibility, and if you find really old things try to remove them if unnecessary, we only have a few dependees that use old APIs.

Co-authored-by: Matthias Bussonnier <bussonniermatthias@gmail.com>

Data class has narrower API than named tuple (does not expose item getters) which means it is less likely that the downstream code would be broken in the future if we introduce new attributes. Also, compared to other places where memory-efficient named tuple is used, `CompletionContext` is only created once per completion and thus does not require as low overhead. For the same reason we can forgo slightly more memory-efficient `@property @cache` stack and just use `@cached_property`.

Carreau · 2022-10-05T10:35:40Z

Ok, let's just try this and refine if needs there is.

jdtsmith · 2022-11-12T02:25:39Z

Playing with the new completer in an Emacs mode I'm building. So far so good; great work. Seems faster as well.

I had formerly overridden the matchers property to remove file and magic matches in certain contexts. disable_matchers is obviously the new way to do that. But is the matchers property no longer consulted for the list of enabled matchers? If so, its value is probably misleading.

krassowski · 2022-11-12T16:54:52Z

Thank you for the feedback @jdtsmith! Would you mind opening an issue with a minimal reproducible example? I don't see what could be the issue as:

IPCompleter.matchers was and remained a dynamically generated list defined as a property (the behaviour here did not change)
Completer.custom_matchers are still prepended to the matchers list (above) and used extensively in tests, so I don't believe there is a regression here
Thanks!

And yes, this PR lays a foundation for improved performance (see #13752 for an example).

krassowski mentioned this pull request Sep 5, 2022

Add completion types for completions obtained with matchers #13734

Closed

Refactor IPCompleter Matcher API

d22310b

krassowski force-pushed the completion-matcher branch from 6089879 to d22310b Compare September 5, 2022 06:50

krassowski added 2 commits September 5, 2022 08:00

Shim TypedDict and NotRequired at runtime until

728bad9

the lowest supported Python version catches up.

Update ipdoctest test

93c8b4d

krassowski mentioned this pull request Sep 6, 2022

Sort parameter completions first and in the signature order #13673

Open

krassowski added 3 commits September 7, 2022 03:05

Implement priority, do_not_suppress, add tests and docs.

cce8529

Correct suppression defaults, add a test for ipython#13735

21f1467

Merge branch 'main' into completion-matcher

b0daec1

krassowski marked this pull request as ready for review September 7, 2022 03:21

Improve type hinting and documentation

d137c7a

lumberbot-app bot added the tab-completion label Sep 7, 2022

krassowski added 2 commits September 7, 2022 22:16

Move typing_extensions optional dependency

b61b12e

highlight → code-block

56b6489

krassowski added 3 commits September 8, 2022 17:13

Fix backslash combining matchers (they require text_until_cursor).

37590bd

Merge branch 'main' into completion-matcher

88889c1

Relax constraint on limit to allow no limit

62af0b0

krassowski commented Sep 8, 2022

View reviewed changes

IPython/core/completer.py Show resolved Hide resolved

This was referenced Sep 8, 2022

Performance of completer: forward unicode matcher #13752

Open

Union trait does not parse correctly from strings ipython/traitlets#772

Closed

Autocompletions randomly stopping completely (or very slow) jupyter-lsp/jupyterlab-lsp#566

Open

Carreau self-assigned this Sep 14, 2022

Carreau reviewed Sep 16, 2022

View reviewed changes

IPython/core/completer.py Show resolved Hide resolved

Carreau reviewed Sep 16, 2022

View reviewed changes

IPython/utils/docs.py Outdated Show resolved Hide resolved

Carreau reviewed Sep 16, 2022

View reviewed changes

docs/sphinxext/apigen.py Show resolved Hide resolved

krassowski and others added 2 commits September 24, 2022 21:15

Remove outdated header as suggested

5bb0259

Co-authored-by: Matthias Bussonnier <bussonniermatthias@gmail.com>

Carreau merged commit dc08a33 into ipython:main Oct 5, 2022

Carreau added this to the 8.6 milestone Oct 19, 2022

krassowski mentioned this pull request Dec 11, 2022

(Quasi-)incorrect keyword argument completion #10013

Open

krassowski mentioned this pull request Feb 12, 2023

Autocomplete repeats content already in the cell. jupyter/notebook#6709

Closed

krassowski mentioned this pull request Apr 12, 2023

Tab-complete behavior when attaching objects to a module #14008

Open

joelostblom mentioned this pull request Oct 2, 2023

Include autocompletion for column names vega/altair#3213

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `IPCompleter` Matcher API #13745

Refactor `IPCompleter` Matcher API #13745

krassowski commented Sep 5, 2022 •

edited

krassowski commented Sep 5, 2022

Carreau commented Sep 6, 2022

joelostblom commented Sep 7, 2022

krassowski commented Sep 7, 2022

Carreau Sep 16, 2022

krassowski Sep 23, 2022

Carreau Sep 16, 2022 •

edited

krassowski Sep 23, 2022

Carreau commented Sep 16, 2022

Carreau commented Oct 5, 2022

jdtsmith commented Nov 12, 2022

krassowski commented Nov 12, 2022

	# FIXME: we should extend our api to return a dict with completions for
	# different types of objects. The rlcomplete() method could then
	# simply collapse the dict into a list for readline, but we'd have
	# richer completion semantics in other environments.



		@sphinx_options(show_inherited_members=True, exclude_inherited_from=["dict"])
		class SimpleMatcherResult(_MatcherResultBase, TypedDict):

Refactor IPCompleter Matcher API #13745

Refactor IPCompleter Matcher API #13745

Conversation

krassowski commented Sep 5, 2022 • edited

User-facing changes

Completion type is returned in the provisional API for most matchers

Dictionary completions are restricted to dictionary keys

Configuration

Matcher API

Matcher API v2

Switching between APIs and Matcher configuration

Maintaining compatibility

Follow-up tasks

krassowski commented Sep 5, 2022

Carreau commented Sep 6, 2022

joelostblom commented Sep 7, 2022

krassowski commented Sep 7, 2022

Carreau Sep 16, 2022

Choose a reason for hiding this comment

krassowski Sep 23, 2022

Choose a reason for hiding this comment

Carreau Sep 16, 2022 • edited

Choose a reason for hiding this comment

krassowski Sep 23, 2022

Choose a reason for hiding this comment

Carreau commented Sep 16, 2022

Carreau commented Oct 5, 2022

jdtsmith commented Nov 12, 2022

krassowski commented Nov 12, 2022

Refactor `IPCompleter` Matcher API #13745

Refactor `IPCompleter` Matcher API #13745

krassowski commented Sep 5, 2022 •

edited

Carreau Sep 16, 2022 •

edited