Fix handling of invalid conversion characters #61

isidentical · 2023-03-28T21:06:56Z

No description provided.

isidentical · 2023-03-28T21:11:04Z

We only have a single f-string conversion character related error left after this PR which is the following: "f'{3!ª}'". The problem here is that, it used to fail because the old f-string parser did not normalize identifiers (or rather treat the character after the ! as an identifier) . Now we treat it as an identifier and when we are checking if it is a/s/r, we don't actually get to see the real token's contents (since pegen already decodes it for us and gives us the normalized version).

We can make them soft keywords (? I think they aren't normalized but not super sure, @pablogsal or @lysnikolaou you can probably correct me on this) OR we simply allow this edge case (or hard-code it in the tokenizer which might look a bit ugly I think).

pablogsal · 2023-03-29T13:03:24Z

We only have a single f-string conversion character related error left after this PR which is the following: "f'{3!ª}'". The problem here is that, it used to fail because the old f-string parser did not normalize identifiers (or rather treat the character after the ! as an identifier) . Now we treat it as an identifier and when we are checking if it is a/s/r, we don't actually get to see the real token's contents (since pegen already decodes it for us and gives us the normalized version).

We can make them soft keywords (? I think they aren't normalized but not super sure, @pablogsal or @lysnikolaou you can probably correct me on this) OR we simply allow this edge case (or hard-code it in the tokenizer which might look a bit ugly I think).

Let's simply allow the edge case, I don't want to add crazy complexity just for this particular case.

pablogsal · 2023-03-29T15:25:50Z

Hummm, actually this PR makes a lot more tests fail so I think I am going to revert it for now

pablogsal · 2023-03-29T15:26:11Z

30 failures -> 52 failures

isidentical · 2023-03-29T16:11:21Z

Uh, in my local branch it was something like 28 to 24? (On test_fstring). Will double check

isidentical · 2023-03-30T01:20:16Z

@pablogsal I can't seem to reproduce your findings 🤔 Is there a chance I'm missing something when trying this?

Before (current revision at the fstring-grammar-rebased-after-sprint)

❯ ./python -m test test_fstring
test test_fstring failed -- multiple errors occurred; run in verbose mode for details
test_fstring failed (30 failures)

== Tests result: FAILURE ==

1 test failed:
    test_fstring

Total duration: 863 ms
Tests result: FAILURE

After (actually before your revert / after this PR):

❯ git checkout HEAD^
HEAD is now at e1940e5e77 Fix test_type_comments
❯ [...]
❯ ./python -m test test_fstring
test test_fstring failed -- multiple errors occurred; run in verbose mode for details
test_fstring failed (24 failures)

== Tests result: FAILURE ==

1 test failed:
    test_fstring

Total duration: 820 ms
Tests result: FAILURE

Note: this time we are allowing normalized unicode identifiers as conversion characters as per the discussion in pablogsal#61 (comment)

pablogsal · 2023-03-30T15:51:42Z

Hummmm, let me check in the main CI if I revert your revert

pablogsal · 2023-03-30T15:52:47Z

Does #62 include this PR as well?

isidentical · 2023-03-30T15:53:19Z

Yep, with the above edge case fixed as well.

Fix handling of invalid conversion characters

b80d9ea

isidentical marked this pull request as ready for review March 28, 2023 21:11

isidentical requested a review from pablogsal as a code owner March 28, 2023 21:11

pablogsal approved these changes Mar 29, 2023

View reviewed changes

pablogsal merged commit 59ec33d into pablogsal:fstring-grammar-rebased-after-sprint Mar 29, 2023

isidentical added a commit to isidentical/cpython that referenced this pull request Mar 30, 2023

Re-fix f-string conversions

3cd1e62

Note: this time we are allowing normalized unicode identifiers as conversion characters as per the discussion in pablogsal#61 (comment)

isidentical mentioned this pull request Mar 30, 2023

Some more opinioted fixes #62

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix handling of invalid conversion characters #61

Fix handling of invalid conversion characters #61

isidentical commented Mar 28, 2023

isidentical commented Mar 28, 2023

pablogsal commented Mar 29, 2023

pablogsal commented Mar 29, 2023

pablogsal commented Mar 29, 2023

isidentical commented Mar 29, 2023

isidentical commented Mar 30, 2023

pablogsal commented Mar 30, 2023

pablogsal commented Mar 30, 2023

isidentical commented Mar 30, 2023

Fix handling of invalid conversion characters #61

Fix handling of invalid conversion characters #61

Conversation

isidentical commented Mar 28, 2023

isidentical commented Mar 28, 2023

pablogsal commented Mar 29, 2023

pablogsal commented Mar 29, 2023

pablogsal commented Mar 29, 2023

isidentical commented Mar 29, 2023

isidentical commented Mar 30, 2023

pablogsal commented Mar 30, 2023

pablogsal commented Mar 30, 2023

isidentical commented Mar 30, 2023