gh-110309: prune empty constant in format specs #110320

sunmy2019 · 2023-10-03T23:21:37Z

close #110309

Issue: empty string constant in f-string format_spec #110309

bedevere-bot · 2023-10-03T23:26:32Z

🤖 New build scheduled with the buildbot fleet by @sunmy2019 for commit 19d1301 🤖

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

Parser/action_helpers.c

pablogsal · 2023-10-03T23:33:36Z

Thanks a lot for the PR @sunmy2019 !

bedevere-bot · 2023-10-03T23:40:40Z

🤖 New build scheduled with the buildbot fleet by @sunmy2019 for commit 60b4666 🤖

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

Parser/action_helpers.c

bedevere-bot · 2023-10-04T01:08:42Z

🤖 New build scheduled with the buildbot fleet by @sunmy2019 for commit 19c68e2 🤖

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

lysnikolaou · 2023-10-04T10:45:22Z

I'm wondering why we cannot do this in the tokenizer directly. Wouldn't something like this work?

cpython on  main [!] via C v15.0.0-clang via 🐍 pyenv 3.11.3 (venv) took 8s 
❯ git diff
diff --git a/Parser/tokenizer.c b/Parser/tokenizer.c
index 41d0d16a47..504bc9bed9 100644
--- a/Parser/tokenizer.c
+++ b/Parser/tokenizer.c
@@ -2639,6 +2639,12 @@ tok_get_fstring_mode(struct tok_state *tok, tokenizer_mode* current_tok, struct
     tok->first_lineno = tok->lineno;
     tok->starting_col_offset = tok->col_offset;
 
+    int in_format_spec = (
+            current_tok->last_expr_end != -1
+            &&
+            INSIDE_FSTRING_EXPR(current_tok)
+    );
+
     // If we start with a bracket, we defer to the normal mode as there is nothing for us to tokenize
     // before it.
     int start_char = tok_nextc(tok);
@@ -2655,6 +2661,10 @@ tok_get_fstring_mode(struct tok_state *tok, tokenizer_mode* current_tok, struct
             return tok_get_normal_mode(tok, current_tok, token);
         }
     }
+    else if (start_char == '}' && in_format_spec) {
+        tok_backup(tok, start_char);
+        return tok_get_normal_mode(tok, current_tok, token);
+    }
     else {
         tok_backup(tok, start_char);
     }
@@ -2726,11 +2736,6 @@ tok_get_fstring_mode(struct tok_state *tok, tokenizer_mode* current_tok, struct
             end_quote_size = 0;
         }
 
-        int in_format_spec = (
-                current_tok->last_expr_end != -1
-                &&
-                INSIDE_FSTRING_EXPR(current_tok)
-        );
         if (c == '{') {
             int peek = tok_nextc(tok);
             if (peek != '{' || in_format_spec) {

Didn't thoroughly test it, but one pass through the tests only shows one failure that's got to do with the error message when we have a lambda in the expression part.

sunmy2019 · 2023-10-04T12:11:06Z

I'm wondering why we cannot do this in the tokenizer directly.

I have no objection to this.

One thing to note: the same special handling of empty str constants is at the other parts of the code.

If your idea is adopted, we'd better clean up those special handling.

sunmy2019 · 2023-10-04T12:46:32Z

I'd propose we land this first and backport to 3.12.

Then maybe we can optimize further on the 3.13 branch.

lysnikolaou · 2023-10-05T08:36:18Z

I'd propose we land this first and backport to 3.12.

Then maybe we can optimize further on the 3.13 branch.

Agreed. Let's land this and then iterate on it in main.

lysnikolaou

Looks good!

pablogsal · 2023-10-05T13:42:57Z

I think we need to regenerate some files (run make regen-all)

prune empty constant in format specs

59c372a

sunmy2019 requested review from pablogsal and lysnikolaou as code owners October 3, 2023 23:21

bedevere-app bot added the awaiting review label Oct 3, 2023

bedevere-app bot mentioned this pull request Oct 3, 2023

empty string constant in f-string format_spec #110309

Closed

📜🤖 Added by blurb_it.

19d1301

sunmy2019 added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 3, 2023

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 3, 2023

pablogsal reviewed Oct 3, 2023

View reviewed changes

Parser/action_helpers.c Show resolved Hide resolved

count non empty nodes first

60b4666

sunmy2019 added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 3, 2023

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 3, 2023

pablogsal reviewed Oct 4, 2023

View reviewed changes

Parser/action_helpers.c Outdated Show resolved Hide resolved

pablogsal reviewed Oct 4, 2023

View reviewed changes

Parser/action_helpers.c Outdated Show resolved Hide resolved

simplify things

19c68e2

sunmy2019 added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 4, 2023

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 4, 2023

lysnikolaou approved these changes Oct 5, 2023

View reviewed changes

bedevere-app bot added awaiting core review and removed awaiting review labels Oct 5, 2023

pablogsal enabled auto-merge (squash) October 5, 2023 13:31

pablogsal merged commit 2cb62c6 into python:main Oct 5, 2023
101 of 106 checks passed

bedevere-app bot removed the awaiting core review label Oct 5, 2023

sunmy2019 deleted the gh-110309 branch October 5, 2023 15:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-110309: prune empty constant in format specs #110320

gh-110309: prune empty constant in format specs #110320

sunmy2019 commented Oct 3, 2023 •

edited by bedevere-app bot

bedevere-bot commented Oct 3, 2023

pablogsal commented Oct 3, 2023

bedevere-bot commented Oct 3, 2023

bedevere-bot commented Oct 4, 2023

lysnikolaou commented Oct 4, 2023

sunmy2019 commented Oct 4, 2023 •

edited

sunmy2019 commented Oct 4, 2023

lysnikolaou commented Oct 5, 2023

lysnikolaou left a comment

pablogsal commented Oct 5, 2023

gh-110309: prune empty constant in format specs #110320

gh-110309: prune empty constant in format specs #110320

Conversation

sunmy2019 commented Oct 3, 2023 • edited by bedevere-app bot

bedevere-bot commented Oct 3, 2023

pablogsal commented Oct 3, 2023

bedevere-bot commented Oct 3, 2023

bedevere-bot commented Oct 4, 2023

lysnikolaou commented Oct 4, 2023

sunmy2019 commented Oct 4, 2023 • edited

sunmy2019 commented Oct 4, 2023

lysnikolaou commented Oct 5, 2023

lysnikolaou left a comment

Choose a reason for hiding this comment

pablogsal commented Oct 5, 2023

sunmy2019 commented Oct 3, 2023 •

edited by bedevere-app bot

sunmy2019 commented Oct 4, 2023 •

edited