bpo-40599: Improve error messages with expected keywords #20039

pablogsal · 2020-05-11T19:32:39Z

The current errors are due to tests that don't expect the extra information in the error messages.

https://bugs.python.org/issue40599

pablogsal · 2020-05-11T19:34:18Z

I open this PR as a draft because I still have some questions about the approach but we can start a discussion to see how to improve it :)

Some examples of the errors with this PR:

>>> from x impirt l
  File "<stdin>", line 1
    from x impirt l
           ^
SyntaxError: Invalid syntax. Expected one of: import
>>> with x os y
  File "<stdin>", line 1
    with x os y
           ^
SyntaxError: Invalid syntax. Expected one of: as, or, if, in, is, not, and
>>> l = 3 if x elso 4
  File "<stdin>", line 1
    l = 3 if x elso 4
               ^
SyntaxError: Invalid syntax. Expected one of: or, in, is, else, not, and
>>> if x:
...   ...
... elif y:
...   ...
... elso
  File "<stdin>", line 5
    elso
    ^
SyntaxError: Invalid syntax. Expected one of: elif, else

With some work, we could extend this to general tokens (although it may be very verbose).

gvanrossum · 2020-05-11T21:51:15Z

This looks like scope creep for the original issue. Even if you can solve this using the new parser, please open a new bpo issue and have a discussion about it there (or perhaps on python-ideas or python-dev?).

pablogsal · 2020-05-11T22:08:33Z

This looks like scope creep for the original issue. Even if you can solve this using the new parser, please open a new bpo issue and have a discussion about it there (or perhaps on python-ideas or python-dev?).

Ok, will do. In any case, I opened the draft PR to get some initial opinions from you and @lysnikolaou to see if is worth pursuing.

pablogsal · 2020-05-11T22:12:05Z

I have linked this against this new issue:

https://bugs.python.org/issue40599

and unlinked it from the original issue.

aeros

I have a bit of feedback on some of the examples. When/if I find the time, I'll experiment further with the PR locally and try to provide more in-depth feedback.

>>> from x impirt l
  File "<stdin>", line 1
    from x impirt l
           ^
SyntaxError: Invalid syntax. Expected one of: import

Huge +1 for offering a specific suggestion when it's unambiguous. However, I think that it could look a significantly better to omit the "one of" when there is only when possible suggestion (assuming it's not too much of an implementation issue). E.g. SyntaxError: Invalid syntax. Expected: import.

>>> with x os y
  File "<stdin>", line 1
    with x os y
           ^
SyntaxError: Invalid syntax. Expected one of: as, or, if, in, is, not, and

I'm -0 on the behavior in the above example. Offering 5+ possible suggestions (7 in this case) for keywords doesn't seem especially helpful and results in a rather long exception message.

My personal preference would be to only display the possible suggestion(s) if there are only 4 or less (maybe 3?) possible keywords, and omit "one of" when there's only one.

As for the implementation details, my knowledge of the C-API and especially the parser is rather limited, but would it be possible to check the number of possible keywords with something like Py_ssize_t err_tokens_size = PySet_Size(p->err_tokens)?

If the above or something along those lines would be feasible, I would be +1 on the PR overall. Otherwise, I'm +0. It seems like a good idea, but I think the error message would be a bit too bloated when there's too many possible keywords.

pablogsal · 2020-05-11T23:27:26Z

Thanks for the feedback @aeros !

I think that the problem is that, as Guido mentions, without tokens this is not very helpful and once we add tokens the possible items in the set will increase a lot. For instance, if you do x y it will complain that it was expecting any operator and all possible operators will be displayed. The same stands for other tokens like ";" or ",". In those errors, is not possible for the parser to know what is the most "likely" one so restricting to only report with "n" tokens will only trigger this on very specific cases.

As I mentioned in the issue, I opened this draft to start a discussion but I and more and more unsure that this approach will give us better debugability than the noise it adds :(

aeros · 2020-05-12T03:07:43Z

In those errors, is not possible for the parser to know what is the most "likely" one so restricting to only report with "n" tokens will only trigger this on very specific cases.

I would personally be okay with that, it could be quite useful at occasionally detecting unambiguous misspellings, such as the import example. Although one may argue that it's more of a job of a linter to pick up on that.

As I mentioned in the issue, I opened this draft to start a discussion but I and more and more unsure that this approach will give us better debugability than the noise it adds :(

Either way, I think this discussion is a good start and gives a general direction (and perhaps more importantly, establishes what it shouldn't do). Thanks for starting the discussion.

the-knights-who-say-ni added the CLA signed label May 11, 2020

bedevere-bot added the awaiting core review label May 11, 2020

pablogsal requested review from lysnikolaou and gvanrossum May 11, 2020 19:34

pablogsal added the skip news label May 11, 2020

bpo-40334: Improve error messages with expected keywords

95880fe

pablogsal force-pushed the errors branch from 3392005 to 95880fe Compare May 11, 2020 19:40

pablogsal changed the title ~~bpo-40334: Improve error messages with expected keywords~~ bpo-40599: Improve error messages with expected keywords May 11, 2020

aeros reviewed May 11, 2020

View reviewed changes

pablogsal closed this May 11, 2020

pablogsal deleted the errors branch May 19, 2021 18:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-40599: Improve error messages with expected keywords #20039

bpo-40599: Improve error messages with expected keywords #20039

Uh oh!

pablogsal commented May 11, 2020 •

edited

Loading

Uh oh!

pablogsal commented May 11, 2020 •

edited

Loading

Uh oh!

gvanrossum commented May 11, 2020

Uh oh!

pablogsal commented May 11, 2020

Uh oh!

pablogsal commented May 11, 2020

Uh oh!

aeros left a comment •

edited

Loading

Uh oh!

pablogsal commented May 11, 2020 •

edited

Loading

Uh oh!

aeros commented May 12, 2020 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

bpo-40599: Improve error messages with expected keywords #20039

bpo-40599: Improve error messages with expected keywords #20039

Uh oh!

Conversation

pablogsal commented May 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pablogsal commented May 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gvanrossum commented May 11, 2020

Uh oh!

pablogsal commented May 11, 2020

Uh oh!

pablogsal commented May 11, 2020

Uh oh!

aeros left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pablogsal commented May 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aeros commented May 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pablogsal commented May 11, 2020 •

edited

Loading

pablogsal commented May 11, 2020 •

edited

Loading

aeros left a comment •

edited

Loading

pablogsal commented May 11, 2020 •

edited

Loading

aeros commented May 12, 2020 •

edited

Loading