Policy for linter fixes that can drop comments #9790

zanieb · 2024-02-02T16:34:47Z

In some cases, a lint fix can drop comments. Retaining all comments was a major goal in the Ruff formatter and we've learned that it is very hard to accomplish. There is significant foundational work that would need to be invested in the linter to prevent it from ever dropping comments. While we would like to accomplish this eventually, we're a small team with a lot of priorities to balance. In the meantime, we want to formalize a policy for fixes that may drop comments.

A few options:

A fix should not be offered if the range contains a comment
A fix that may drop a comment must always be unsafe
A fix that may drop a comment must be unsafe if a comment is present in the fix range
A fix will only drop comments in ranges that use dangerous syntax e.g. line continuations
A fix that may drop a comment is always safe — we do not make guarantees about comments

We currently apply a mix of these options, but we should be consistent. More options and discussion welcome.

Related issues and reports:

The text was updated successfully, but these errors were encountered:

charliermarsh · 2024-02-02T16:35:54Z

Another thing we do in some places (which IMO is incorrect): if the range contains a comment, we don't offer the fix at all. For example, this is true of SIM401.

charliermarsh · 2024-02-02T19:24:57Z

There also used to be some cases in which we wouldn't flag the violation at all if the range contained comments, but I think we removed these. (I'm not certain.)

charliermarsh · 2024-02-02T19:28:30Z

Just to put a decision out there for discussion, I'd propose...

Dropping comments should be treated as unsafe (and should be documented as such -- it's not unsafe per se based on the current definition in the rustdoc, in my opinion). But we should avoid disabling violations and disabling fixes based on the presence of comments. We will go through and audit rules that rewrites large blocks of code (this tends not to be the norm), e.g., those that modify entire statements, and add appropriate guardrails to them.

charliermarsh · 2024-02-02T21:42:26Z

Since I was asked elsewhere to clarify, the above suggestion is meant to say:

We should not disable a rule due to the presence of comments.

We should not disable a fix due to the presence of comments.

We should mark a fix as unsafe if there's a comment in the range.

We should only prioritize proactively changing the behavior of rules that rewrite large blocks of code (modify entire statements, or convert statements to expressions or vice versa).

MichaReiser · 2024-02-04T11:31:09Z

The ideal situation would be that all rules handle comments correctly and only remove comments if a fix removes the entire code block containing the comment.

Now, what the short-term ideal should be is something I find much harder to implement, and I don't feel I have the understanding to decide on that. What's important to me is that we can come up with a proposal that avoids making the safe and unsafe rule separation meaningless because too many rules are unsafe, so that users who want to use most of our fixes are forced (or motivated) to opt-in to enable unsafe fixes by default.

That's why I need a better understanding of how many rules would be affected by any of the above proposals and the ideal solution to me is the solution where only enabling safe fixes remains the best default for users while being as explicit (and defensive) in which situation ruff removes comments.

UnknownPlatypus · 2024-02-04T14:54:13Z

Chiming in because I recently encountered a somehow related issue.
I got bitten by a combinaison of TID252 with a noqa: F401 on one of the imports.

Basically something like that:

-from . import (
-    foo,
-    bar,
-    MyVeryImportantClass, # noqa: F401
-)
+ from yay import foo, bar

I think what you proposed above about demoting the fix safety to unsafe in case there are comments would have solve my issue here.

jakkdl · 2024-03-06T15:53:34Z

My 2 cents:
unsafe-because-it-may-remove-comments vs unsafe-because-it-may-impact-logic (e.g. sort stability in C413/C414) should perhaps not be treated the same in terms of flags or warnings. Depending on what you settle on, you might want different flags or different levels of unsafe-ness to differentiate them.

If possible it would be great if ruff could detect whether it has lost comments, and surface that to the user. I don't have as strong opinions between

dump lost comments somewhere in the vicinity for the user to deal with
warn the user about comments that have been deleted
say that a fix was attempted, but failed because it would delete a comment, and prompt the user to possibly add a flag that would force the --fix and delete the comment.

I do agree that the current status quo is kinda iffy, but I've also spent an unreasonable amount of time trying to preserve comments in AST & libcst parsers that I can empathize <3

zanieb added the fixes Related to suggested fixes for violations label Feb 2, 2024

zanieb mentioned this issue Feb 2, 2024

PLR5501 fix ate my comment! #9779

Open

charliermarsh mentioned this issue Feb 20, 2024

FURB171: Auto-fix deletes comments #10063

Open

charliermarsh mentioned this issue Mar 6, 2024

C414 --fix is also unsafe for sorted(reversed(iterable)), and other issues encountered when comparing with shed #10245

Open

AlexWaygood mentioned this issue May 3, 2024

[flake8-pyi] Implement PYI059 (generic-not-last-base-class) #11233

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Policy for linter fixes that can drop comments #9790

Policy for linter fixes that can drop comments #9790

zanieb commented Feb 2, 2024 •

edited

charliermarsh commented Feb 2, 2024

charliermarsh commented Feb 2, 2024

charliermarsh commented Feb 2, 2024

charliermarsh commented Feb 2, 2024

MichaReiser commented Feb 4, 2024

UnknownPlatypus commented Feb 4, 2024

jakkdl commented Mar 6, 2024

Policy for linter fixes that can drop comments #9790

Policy for linter fixes that can drop comments #9790

Comments

zanieb commented Feb 2, 2024 • edited

charliermarsh commented Feb 2, 2024

charliermarsh commented Feb 2, 2024

charliermarsh commented Feb 2, 2024

charliermarsh commented Feb 2, 2024

MichaReiser commented Feb 4, 2024

UnknownPlatypus commented Feb 4, 2024

jakkdl commented Mar 6, 2024

zanieb commented Feb 2, 2024 •

edited