Slow completion for existing object with long `repr` method #919

ElieGouzien · 2017-04-29T12:29:52Z

Hi,

For completion with existing objects it seems that the __repr__() method is evaluated for an error message within the inspect module which is afterward caught. This makes the completion as slow as __repr__() while this evaluation in unnecessary for the user. This method can bee rather slow with big pandas objects or custom class.

This as been seen here :
qtconsole issue #90

A minimal code to reproduce it :

import jedi
class Bugger(object):
    def __init__(self, size=10000):
        """Create bigg data."""
        self.big_data = [list(range(i)) for i in range(size)]

    def easy_method(self):
        """Method that should be really fast."""
        return self.big_data[-1][-1]

    def __repr__(self):
        output = ""
        for nested in self.big_data:
            for elem in nested:
                output += str(elem)
            output += '\n'
        return output

test = Bugger()
jedi.Interpreter('test.ea', [locals()]).completions()

And a more convenient working version :

import jedi, pdb, inspect
class Bugger(object):
    def __init__(self, size=10):
        """Create bigg data."""
        self.big_data = [list(range(i)) for i in range(size)]

    def easy_method(self):
        """Method that should be really fast."""
        return self.big_data[-1][-1]

    def __repr__(self):
        frames = inspect.getouterframes(inspect.currentframe())
        for frame in frames:
            print(frame)
        pdb.set_trace()
        output = ""
        for nested in self.big_data:
            for elem in nested:
                output += str(elem)
            output += '\n'
        return output

test = Bugger()
jedi.Interpreter('test.ea', [locals()]).completions()

Jedi seems to end here through jedi/evaluate/compiled/mixed.py, line 121.
Maybe making checks before the calling inspect.getsourcefile() could fix it, but not I don't know jedi enough to claim it.

The text was updated successfully, but these errors were encountered:

mangecoeur · 2017-05-04T14:58:00Z

I also experience this issue, though ipython 6.0 . Note that the completion appears to be slower than a single repr call, suggesting it may be calling it many times.

davidhalter · 2017-05-04T16:36:24Z

For what it's worth, I'm pretty sure it's not the __repr__ that is slowing Jedi down. Jedi doesn't execute source code. Or let's say it tries to actively avoid it.

ElieGouzien · 2017-05-04T17:17:52Z

Let me precise that this performance issue was been found from IPython completion (since it uses Jedi). Then it sounds reasonable to me that it does work with "executed code" since I don't think IPython gives the full code history to Jedi but the existing objects (but I don't know internals of IPython neither Jedi so don't give to much credits to my guesses).

A typical example (from which I actually determined that Jedi is involved) is available here : qtconsole issue #90

davidhalter · 2017-05-04T17:38:01Z

@ElieGouzien It does obviously work with executed code. It just tries to not execute it. One of the issues you might be having is that Jedi tries to load the corresponding files of code (to improve autocompletion). This might be a lot of work (depending on the size of the library).

What do you guys generally say that would be slow? (in seconds)

mangecoeur · 2017-05-04T17:42:56Z

@davidhalter this seems to be specifically related to data objects rather than code files - particularly things like Pandas tables or large numpy arrays. It seems something related to jedi is doing something with the data object. It might be something to do with the way IPython uses Jedi? Perhaps some serialization going on?

ElieGouzien · 2017-05-04T17:52:35Z

@davidhalter In my case I had something like 10-30 s (with custom class). Basically it's as long as computing repr on the object takes.

What happens (I think) is that when inspect.getsourcefile fails it computes repr to include in it's error message ; but as jedi catches it and finds another way to make the completion it's useless. If It sounds good to you I can try to make a check function to anticipate the failure of inspect.getsourcefile and just don't call it in that case.

ElieGouzien · 2017-05-06T11:49:07Z

Ok, I have a patch !

@davidhalter Where should I put a test for that fix ?

EDIT : I think I figured out where to put it but I'm still not 100% sure I'm right. See in the pull request #922.

Anticipate the raise of TypeError from inspect.getfile to prevent the computation of repr() for the error message wich is not used. Useful for some big pandas arrays. Fix tentative of #919.

Was reported with issue #919.

takluyver · 2017-06-12T15:43:47Z

I've submitted a bug and a PR to Python to try to improve this in the inspect module:

http://bugs.python.org/issue30639
python/cpython#2132

davidhalter · 2017-06-13T11:53:14Z

Nice! Thanks!

ElieGouzien mentioned this issue Apr 29, 2017

Ipython qtconsole continues line instead of executing on return jupyter/qtconsole#90

Open

davidhalter added the performance label Apr 29, 2017

makmanalp mentioned this issue May 3, 2017

Slow tab-completion on large data object in notebook ipython/ipython#10493

Open

ElieGouzien mentioned this issue May 6, 2017

Fix #919 by preventing unecessary call to repr() from within inspect. #922

Merged

davidhalter pushed a commit that referenced this issue May 6, 2017

Test that no repr() can slow down completion.

9d5cc0b

Was reported with issue #919.

davidhalter closed this as completed May 6, 2017

davidhalter mentioned this issue Jun 14, 2017

Jedi hangs for a long time with DataFrame containing timestamps #931

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow completion for existing object with long `repr` method #919

Slow completion for existing object with long `repr` method #919

ElieGouzien commented Apr 29, 2017

mangecoeur commented May 4, 2017

davidhalter commented May 4, 2017

ElieGouzien commented May 4, 2017

davidhalter commented May 4, 2017

mangecoeur commented May 4, 2017

ElieGouzien commented May 4, 2017 •

edited

Loading

ElieGouzien commented May 6, 2017 •

edited

Loading

takluyver commented Jun 12, 2017

davidhalter commented Jun 13, 2017

Slow completion for existing object with long __repr__ method #919

Slow completion for existing object with long __repr__ method #919

Comments

ElieGouzien commented Apr 29, 2017

mangecoeur commented May 4, 2017

davidhalter commented May 4, 2017

ElieGouzien commented May 4, 2017

davidhalter commented May 4, 2017

mangecoeur commented May 4, 2017

ElieGouzien commented May 4, 2017 • edited Loading

ElieGouzien commented May 6, 2017 • edited Loading

takluyver commented Jun 12, 2017

davidhalter commented Jun 13, 2017

Slow completion for existing object with long `repr` method #919

Slow completion for existing object with long `repr` method #919

ElieGouzien commented May 4, 2017 •

edited

Loading

ElieGouzien commented May 6, 2017 •

edited

Loading