Black cannot parse previously parseable file in 24.4.1 #4329

mrmundt · 2024-04-24T15:35:34Z

Describe the bug

The newest version of black started failing on our CI due to an error about not being able to parse a line that it used to be able to parse just fine.

To Reproduce

On the file pyomo/contrib/pyros/util.py: black pyomo/contrib/pyros/util.py

The resulting error is:

error: cannot format pyomo/contrib/pyros/util.py: Cannot parse: 2009:77: return f"{attr_val_str:{f'{self._ATTR_FORMAT_LENGTHS[attr_name]}'}}"

Expected behavior

It doesn't just error and fail on that file.

Environment

Black's version: 24.4.1
OS and Python version: MacOS 3.11; Ubuntu 22.04 3.10

The text was updated successfully, but these errors were encountered:

JelleZijlstra · 2024-04-24T15:37:49Z

Thanks! cc @tusharsadhwani.

mrmundt · 2024-04-24T15:39:37Z

No no, thank YOU! We love your tool :) (Well, I love it. My team grumbles when they forget to run it and our linting job snarks at them.)

tusharsadhwani · 2024-04-24T15:46:46Z

This seems like the minimal reproduction:

f"{1:{f'{2}'}}"

tusharsadhwani · 2024-04-24T15:50:41Z

actually, using same or different quotes gives us two different crash scenarios:

f'{1:{f'{2}'}}'

If the quotes of the outer and inner fstring are the same, we get a different crash.

tarper24 · 2024-04-24T15:52:09Z

Using the same quotes is a syntax error in Python itself. You terminate the string early.

>>> f'{1:{f'{2}'}}'
  File "<stdin>", line 1
    f'{1:{f'{2}'}}'
            ^
SyntaxError: f-string: expecting '}'

tusharsadhwani · 2024-04-24T15:52:37Z

@tarper24 it works fine on Python 3.12 onwards.

JelleZijlstra · 2024-04-24T15:53:49Z

@tarper24 not in Python 3.12 any more. That's actually why we made this change; we had to revamp the parser around f-strings to support the new syntax. Unfortunately that caused us to start failing on some f-strings that were already valid. We found a few such cases before release by running Black on various codebases, but unfortunately we missed your case.

JelleZijlstra · 2024-04-25T05:11:22Z

I spent some time on this but couldn't figure out a solution yet.

The reproducer gets tokenized like this:

% python -m blib2to3.pgen2.tokenize 4329.py
1,0-1,2:	FSTRING_START	'f"'
1,2-1,2:	FSTRING_MIDDLE	''
1,2-1,3:	LBRACE	'{'
1,3-1,4:	NUMBER	'1'
1,4-1,5:	OP	':'
1,5-1,5:	FSTRING_MIDDLE	''
1,5-1,6:	OP	'{'
1,6-1,8:	FSTRING_START	"f'"
1,8-1,8:	FSTRING_MIDDLE	''
1,8-1,9:	LBRACE	'{'
1,9-1,10:	NUMBER	'2'
1,10-1,11:	OP	'}'
1,11-1,12:	FSTRING_MIDDLE	"'"
1,12-1,13:	RBRACE	'}'
Traceback (most recent call last):

The FSTRING_MIDDLE "'" near the end is wrong; it should be an FSTRING_END, closing the inner f-string.

My current thinking is that the issue is that the inside_fstring_colon in the tokenizer gets set to True for the outer f-string and then applied incorrectly while we're parsing the inner f-string. To address that, I tried turning inside_fstring_colon into a stack with an entry for each nested f-string, but that so far doesn't work.

tusharsadhwani · 2024-04-25T05:22:13Z

Commenting out and bracelev == 0 in the part that yields RBRACE fixes this case. But it breaks other cases. That's how far I got yesterday night

tusharsadhwani · 2024-04-25T05:23:04Z

Also it's not the FSTRING_MIDDLE that's incorrect, it's the OP just above it, which should be an RBRACE to match the LBRACE.

tusharsadhwani · 2024-04-25T05:24:15Z

The minimised case that breaks when making the bracelev change is:

f'{1:{2}d}'

JelleZijlstra · 2024-04-25T05:37:04Z

What is the difference between OP and LBRACE/RBRACE here? I noticed the variation but it wasn't clear to me which one is correct.

tusharsadhwani · 2024-04-25T05:38:57Z

In the original impl it's very blurry what to use, but I went with yielding LBRACE whenever we go from collecting FSTRING_MIDDLE tokens to parsing python expressions again

JelleZijlstra · 2024-04-25T06:28:02Z

I got something that appears to work: #4332.

mrmundt added the T: bug Something isn't working label Apr 24, 2024

jsiirola mentioned this issue Apr 25, 2024

Skip black 24.4.1 due to a bug in the parser Pyomo/pyomo#3247

Merged

JelleZijlstra mentioned this issue Apr 25, 2024

Fix incorrect f-string tokenization #4332

Merged

user27182 mentioned this issue Apr 25, 2024

Cannot format f-strings where the contents re-use the same quotes as the enclosing f-string #4334

Closed

JelleZijlstra closed this as completed in #4332 Apr 25, 2024

tarper24 mentioned this issue Apr 26, 2024

Cannot parse multiline f-string containing multiline string #4337

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Black cannot parse previously parseable file in 24.4.1 #4329

Black cannot parse previously parseable file in 24.4.1 #4329

mrmundt commented Apr 24, 2024 •

edited

JelleZijlstra commented Apr 24, 2024

mrmundt commented Apr 24, 2024

tusharsadhwani commented Apr 24, 2024

tusharsadhwani commented Apr 24, 2024

tarper24 commented Apr 24, 2024

tusharsadhwani commented Apr 24, 2024

JelleZijlstra commented Apr 24, 2024

JelleZijlstra commented Apr 25, 2024

tusharsadhwani commented Apr 25, 2024

tusharsadhwani commented Apr 25, 2024 •

edited

tusharsadhwani commented Apr 25, 2024

JelleZijlstra commented Apr 25, 2024

tusharsadhwani commented Apr 25, 2024 •

edited

JelleZijlstra commented Apr 25, 2024

Black cannot parse previously parseable file in 24.4.1 #4329

Black cannot parse previously parseable file in 24.4.1 #4329

Comments

mrmundt commented Apr 24, 2024 • edited

JelleZijlstra commented Apr 24, 2024

mrmundt commented Apr 24, 2024

tusharsadhwani commented Apr 24, 2024

tusharsadhwani commented Apr 24, 2024

tarper24 commented Apr 24, 2024

tusharsadhwani commented Apr 24, 2024

JelleZijlstra commented Apr 24, 2024

JelleZijlstra commented Apr 25, 2024

tusharsadhwani commented Apr 25, 2024

tusharsadhwani commented Apr 25, 2024 • edited

tusharsadhwani commented Apr 25, 2024

JelleZijlstra commented Apr 25, 2024

tusharsadhwani commented Apr 25, 2024 • edited

JelleZijlstra commented Apr 25, 2024

mrmundt commented Apr 24, 2024 •

edited

tusharsadhwani commented Apr 25, 2024 •

edited

tusharsadhwani commented Apr 25, 2024 •

edited