Pyupgrade: Format specifiers #1594

colin99d · 2023-01-03T12:09:58Z

A part of #827. Posting this for visibility. Still has some work to do to be done.

Things that still need done before this is ready:

Does not work when the item is being assigned to a variable
Does not work if being used in a function call
Fix incorrectly removed calls in the function
Has not been tested with pyupgrade negative test cases

Tests from pyupgrade can be seen here: https://github.com/asottile/pyupgrade/blob/main/tests/features/format_literals_test.py

colin99d · 2023-01-04T12:47:41Z

I will try to have this done by Sunday night.

src/pyupgrade/plugins/format_specififiers.rs

colin99d · 2023-01-06T22:19:17Z

@charliermarsh I have the following function to detect expressions:

pub fn match_expression(expression_text: &str) -> Result<Expression> {
    match libcst_native::parse_expression(expression_text) {
        Ok(expression) => Ok(expression),
        Err(_) => bail!("Failed to extract CST from source"),
    }
}

It works fine for an expression like this:

    'foo{0}' 'bar{1}'.format(1, 2)

But the second you make it multi-line like so:

    'foo{0}'\n    'bar{1}'.format(1, 2)

The whole thing fails. Is this an error with my coding, or an error with libcst_native?

charliermarsh · 2023-01-06T22:39:46Z

I'm wondering if you need to dedent the code. You might be passing LibCST the indented expression, which would cause an error.

colin99d · 2023-01-06T22:50:42Z

It looks like the issue still happens with the de-indented code: 'foo{0}'\n'bar{1}'.format(1, 2).

…UP_DELETEME.py --no-cache --select UP

colin99d · 2023-01-07T03:58:15Z

thread 'main' panicked at 'called Result::unwrap() on an Err value: ParserError(ParseError { location: ParseLoc { start_pos: LineCol { line: 2, column: 4, offset: 13 }, end_pos: LineCol { line: 2, column: 12, offset: 21 } }, expected: ExpectedSet { expected: {"EOF"} } }, "'foo{0}'\n 'bar{1}'.format(1, 2)")', src/pyupgrade/plugins/format_specififiers.rs:57:57

Looks like this is a parse error from the parse_expression function in libcst_native. My thoughts are we either:

Make a custom patch in your fork
Find a way to modify the string to add a \ at the end of each line so the parser can understand it, and then remove it once we are done
Don't use libcst_native ( I dont like this one because I like libcst now)

Just let me know which you would like for me to pursue.

charliermarsh · 2023-01-07T23:11:18Z

Would you be open to filing an issue on the LibCST repo? In the meantime, we could just skip those cases. (Are they needed for detection, or just autofixing?)

colin99d · 2023-01-08T23:54:18Z

Here is the issue:
Instagram/LibCST#846
I will just make it a check for now!

charliermarsh · 2023-01-09T01:04:59Z

src/pyupgrade/plugins/format_specififiers.rs

                            new_call,
                            expr.location,
                            expr.end_location.unwrap(),
                        ));
                    }
-                    checker.add_check(check);
+                    checker.diagnostics.push(diagnostic);


Thank you and sorry that this all churned mid-PR for you.

Im never going to complain about you making this project easier to contribute to.

colin99d · 2023-01-09T03:07:57Z

Hey @charliermarsh I got all the positive test cases working (except for the one edge case we talked about), and all the negative cases working except one. I wanted to get your feedback before I implemented it.
The following edge case is what causes issues:

'{' '0}'.format(1)

Since the format specifier is split into two different strings, it is NOT supposed to be formatted. However, both the Expr
struct, and the tokens automatically combine this into one string. What would be the best way of detecting that this edge case is preset? Or do we want to ignore it? Since the refactoring would produce changes that could be seen as desirable:

charliermarsh · 2023-01-09T06:13:21Z

I think it's best just to ignore it (i.e., treat it as if there is no space). It strikes me as exceptionally rare, to the point that it's likely either a mistake or that it's not worth adding more complexity to our own implementation to handle it.

charliermarsh · 2023-01-09T06:13:34Z

We could probably detect it by tokenizing... but it doesn't seem worth it to me.

colin99d · 2023-01-09T12:14:28Z

I think it's best just to ignore it (i.e., treat it as if there is no space). It strikes me as exceptionally rare, to the point that it's likely either a mistake or that it's not worth adding more complexity to our own implementation to handle it.

100% agree, especially since the python code will still work.

colin99d · 2023-01-09T12:25:23Z

I just need to fix one bug with this edge case, and then I should be able to take this PR out of draft.

colin99d · 2023-01-10T21:35:08Z

@charliermarsh anything else you want to see from this for merge?

charliermarsh · 2023-01-10T21:50:07Z

@colin99d - Will review this tonight!

charliermarsh · 2023-01-11T00:18:19Z

src/pyupgrade/rules/format_specifiers.rs

+        // FOR REVIEWER: If the new and old are identical, don't create a fix. Pyupgrade
+        // doesnt even want me to report this, so we could just have an enum for errors,
+        // and if a special one is returned here then we dont even report a fix
+        if module_text == final_state.to_string() {


When does this happen?

Ah nevermind.

charliermarsh · 2023-01-11T01:18:34Z

src/pyupgrade/rules/format_literals.rs

+}
+
+/// UP030
+pub(crate) fn format_literals(checker: &mut Checker, summary: &FormatSummary, expr: &Expr) {


@colin99d - It turns out that we already had a utility for extracting the format positions, which uses the RustPython string parser underneath and so is very robust. Sorry that I didn't flag this earlier -- I didn't realize it could even be reused here, but it should make things more efficient too since we can do one string parse and share that "summary" amongst a bunch of checks.

charliermarsh · 2023-01-11T01:19:15Z

src/pyupgrade/rules/format_literals.rs

+                contents,
+                expr.location,
+                expr.end_location.unwrap(),
+            ));


I feel like one strategy that could work here would be...

Take the entire text.

Replace the string part via the regex above.

Replace the arguments via LibCST.

Kinda tough, hacky, etc... but would solve the missing cases.

Actually, I guess that wouldn't solve the '{' '0}'.format(1) case.

colin99d · 2023-01-11T01:50:03Z

Thanks for the review!

[![Mend Renovate](https://app.renovatebot.com/images/banner.svg)](https://renovatebot.com) This PR contains the following updates: | Package | Change | Age | Adoption | Passing | Confidence | |---|---|---|---|---|---| | [ruff](https://togithub.com/charliermarsh/ruff) | `^0.0.217` -> `^0.0.218` | [![age](https://badges.renovateapi.com/packages/pypi/ruff/0.0.218/age-slim)](https://docs.renovatebot.com/merge-confidence/) | [![adoption](https://badges.renovateapi.com/packages/pypi/ruff/0.0.218/adoption-slim)](https://docs.renovatebot.com/merge-confidence/) | [![passing](https://badges.renovateapi.com/packages/pypi/ruff/0.0.218/compatibility-slim/0.0.217)](https://docs.renovatebot.com/merge-confidence/) | [![confidence](https://badges.renovateapi.com/packages/pypi/ruff/0.0.218/confidence-slim/0.0.217)](https://docs.renovatebot.com/merge-confidence/) | --- ### Release Notes <details> <summary>charliermarsh/ruff</summary> ### [`v0.0.218`](https://togithub.com/charliermarsh/ruff/releases/tag/v0.0.218) [Compare Source](https://togithub.com/charliermarsh/ruff/compare/v0.0.217...v0.0.218) #### What's Changed - Implement flake8-simplify SIM112 by [@messense](https://togithub.com/messense) in [astral-sh/ruff#1764 - Do not autofix PT004 and PT005 by [@harupy](https://togithub.com/harupy) in [astral-sh/ruff#1763 - Disable release builds on CI by [@charliermarsh](https://togithub.com/charliermarsh) in [astral-sh/ruff#1761 - Move CONTRIBUTING.md to top-level by [@charliermarsh](https://togithub.com/charliermarsh) in [astral-sh/ruff#1768 - \[`flake8-bandit`] Add Rule for `S508` (snmp insecure version) & `S509` (snmp weak cryptography) by [@saadmk11](https://togithub.com/saadmk11) in [astral-sh/ruff#1771 - Generate RuleCode::origin() via macro by [@not-my-profile](https://togithub.com/not-my-profile) in [astral-sh/ruff#1770 - Disable doctests by [@charliermarsh](https://togithub.com/charliermarsh) in [astral-sh/ruff#1772 - Enable isort-style `required-imports` enforcement by [@charliermarsh](https://togithub.com/charliermarsh) in [astral-sh/ruff#1762 - Pyupgrade: Format specifiers by [@colin99d](https://togithub.com/colin99d) in [astral-sh/ruff#1594 - Avoid B023 false-positives for some common builtins by [@charliermarsh](https://togithub.com/charliermarsh) in [astral-sh/ruff#1776 **Full Changelog**: astral-sh/ruff@v0.0.217...v0.0.218 </details> --- ### Configuration 📅 **Schedule**: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR is behind base branch, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Mend Renovate](https://www.mend.io/free-developer-tools/renovate/). View repository job log [here](https://app.renovatebot.com/dashboard#github/ixm-one/pytest-cmake-presets).  Signed-off-by: Renovate Bot <bot@renovateapp.com> Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

colin99d added 3 commits January 2, 2023 14:13

Began work on the parser

672c984

Continued progress

c127480

Further along but broken

5e686c4

colin99d added 5 commits January 4, 2023 19:47

Fixed tests

e599734

Added more checks

c68a9da

Added all should pass edge cases

b8c06e7

Fixing small mistakes

6f3c4e5

Added fixes

37d25dd

charliermarsh reviewed Jan 5, 2023

View reviewed changes

src/pyupgrade/plugins/format_specififiers.rs Outdated Show resolved Hide resolved

colin99d added 3 commits January 6, 2023 17:22

Replaced lazy_static with lazy

287f90f

Fixed merge conflicts

cb836f9

Fixed typos

7645bc5

Fixed incorrect import

dfee10f

Hunting down error with: cargo run resources/test/fixtures/pyupgrade/…

aa714a8

…UP_DELETEME.py --no-cache --select UP

Merged

ec15215

charliermarsh reviewed Jan 9, 2023

View reviewed changes

colin99d added 7 commits January 8, 2023 20:07

For a multiline print statement just add a check and not a fix

257c828

Added fix for helper functions, and column for SDK type

782fce3

Updated mod

db92e9b

Aded fixes

5adbe05

Added negative cases, fixed one negative edge case

1fd88cd

Handled one more negative edge case

fc6af20

Made progress in testing

7920747

Clippy and fmt

aa0c2cb

colin99d added 3 commits January 9, 2023 07:20

Fixed merge conflicts

29241ef

Added new tests

ef493b2

Fixed linters

0d395a3

Fixed last bug

76fbc56

colin99d marked this pull request as ready for review January 9, 2023 15:59

colin99d and others added 2 commits January 9, 2023 11:36

Fixed clippy and docs

5121008

Merge branch 'main' into formatspecifiers

c237e4e

Merge branch 'main' into formatspecifiers

75ca3ba

charliermarsh reviewed Jan 11, 2023

View reviewed changes

charliermarsh added 2 commits January 10, 2023 19:35

Use ? in lieu of match, rename to format_literals

4e84dd6

Use format.rs

14a01ec

charliermarsh reviewed Jan 11, 2023

View reviewed changes

charliermarsh merged commit c016c41 into astral-sh:main Jan 11, 2023

colin99d deleted the formatspecifiers branch January 11, 2023 01:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pyupgrade: Format specifiers #1594

Pyupgrade: Format specifiers #1594

colin99d commented Jan 3, 2023 •

edited

colin99d commented Jan 4, 2023

colin99d commented Jan 6, 2023 •

edited

charliermarsh commented Jan 6, 2023

colin99d commented Jan 6, 2023 •

edited

colin99d commented Jan 7, 2023

charliermarsh commented Jan 7, 2023

colin99d commented Jan 8, 2023

charliermarsh Jan 9, 2023

colin99d Jan 9, 2023

colin99d commented Jan 9, 2023 •

edited

charliermarsh commented Jan 9, 2023

charliermarsh commented Jan 9, 2023

colin99d commented Jan 9, 2023

colin99d commented Jan 9, 2023 •

edited

colin99d commented Jan 10, 2023

charliermarsh commented Jan 10, 2023

charliermarsh Jan 11, 2023

charliermarsh Jan 11, 2023

charliermarsh Jan 11, 2023

charliermarsh Jan 11, 2023

charliermarsh Jan 11, 2023

colin99d commented Jan 11, 2023

Pyupgrade: Format specifiers #1594

Pyupgrade: Format specifiers #1594

Conversation

colin99d commented Jan 3, 2023 • edited

colin99d commented Jan 4, 2023

colin99d commented Jan 6, 2023 • edited

charliermarsh commented Jan 6, 2023

colin99d commented Jan 6, 2023 • edited

colin99d commented Jan 7, 2023

charliermarsh commented Jan 7, 2023

colin99d commented Jan 8, 2023

charliermarsh Jan 9, 2023

Choose a reason for hiding this comment

colin99d Jan 9, 2023

Choose a reason for hiding this comment

colin99d commented Jan 9, 2023 • edited

charliermarsh commented Jan 9, 2023

charliermarsh commented Jan 9, 2023

colin99d commented Jan 9, 2023

colin99d commented Jan 9, 2023 • edited

colin99d commented Jan 10, 2023

charliermarsh commented Jan 10, 2023

charliermarsh Jan 11, 2023

Choose a reason for hiding this comment

charliermarsh Jan 11, 2023

Choose a reason for hiding this comment

charliermarsh Jan 11, 2023

Choose a reason for hiding this comment

charliermarsh Jan 11, 2023

Choose a reason for hiding this comment

charliermarsh Jan 11, 2023

Choose a reason for hiding this comment

colin99d commented Jan 11, 2023

colin99d commented Jan 3, 2023 •

edited

colin99d commented Jan 6, 2023 •

edited

colin99d commented Jan 6, 2023 •

edited

colin99d commented Jan 9, 2023 •

edited

colin99d commented Jan 9, 2023 •

edited