[`pycodestyle`] Add `too many blank lines` (`E303`) #4635

hoel-bagard · 2023-05-24T15:16:32Z

Summary

This PR is part of #2402, it adds the E303 error rule with its fix.

Test Plan

It was tested on the added fixture: crates/ruff/resources/test/fixtures/pycodestyle/E303.py.

…bility.

crates/ruff/resources/test/fixtures/pycodestyle/E303.py

crates/ruff/src/rules/pycodestyle/rules/too_many_blank_lines.rs

MichaReiser · 2023-05-24T15:23:17Z

crates/ruff/src/rules/pycodestyle/rules/too_many_blank_lines.rs

+            let range = locator.full_lines_range(TextRange::new(line.start(), last_blank_line));
+            let mut diagnostic = Diagnostic::new(TooManyBlankLines(nb_blank_lines), range);
+            diagnostic.set_fix(Fix::suggested(Edit::range_replacement(
+                "\n\n".to_string(),


We should use stylist (should be on checker) to use the user's preferred newline character.

MichaReiser · 2023-05-24T15:26:23Z

crates/ruff/src/rules/pycodestyle/rules/too_many_blank_lines.rs

+        && !locator
+            .line(TextSize::new(line.start().to_u32() - 1))
+            .trim()
+            .is_empty()


Determining the line is fairly expensive using. You may also be unlucky if the previous line ends with \r\n because you then index right between the \r and \n and there's no good way for the locator to know that it is now between a \r\n.

I'm not quite sure what the best way is to implement this rule. Ideally, it could keep a state between each line, counting the empty lines to this point (CC: @charliermarsh).

I believe pycodestyle tracks the number of empty lines while building up logical lines -- we should probably do the same? That's now straightforward because we emit NonLogicalNewline whenever we see an empty line.

We do, but the physical lines rule operates on the String and I think the logical lines check intentionally skips empty lines... Where would you place this rule? Would this be something new? A rule that directly works on the tokens.

You'd place it at the next logical line -- you'd check the number of preceding empty lines, I think?

Looking at the pycodestyle rule again, my implementation is wrong anyway. The following code should generate an error:

class MyClass(object): def func1(): pass def func2(): pass

-> E303 too many blank lines (2)

I'll try again using logical lines.

@MichaReiser @charliermarsh I've had another go using logical lines. It's a very, very rough draft (that might have broken a few other rules), but could you tell me if you think it's worth continuing on that path ?

The modifications I made to the logical_lines.rs file can be seen here, and the checking function is here. I've also had to remove the explicit skip of empty lines here.

I still haven't solved the issue with \r\n, but maybe I could count the number of characters to remove along with the number of blank lines ?

MichaReiser · 2023-05-24T15:28:03Z

crates/ruff/src/rules/pycodestyle/rules/too_many_blank_lines.rs

+            previous_line_end = locator.full_line_end(previous_line_end);
+            let previous_line = locator.line(previous_line_end);


Line will have to repeat the search for the start and end of the line, which is fairly expensive.

Is there a way that we would only need to find the end (or start) and can then construct the TextRange ourselves (because we know what e.g. the end of the current line is?).

github-actions · 2023-05-24T17:12:06Z

PR Check Results

Benchmark

Linux

group                                      main                                   pr
-----                                      ----                                   --
linter/all-rules/large/dataset.py          1.00     14.9±0.03ms     2.7 MB/sec    1.01     15.0±0.03ms     2.7 MB/sec
linter/all-rules/numpy/ctypeslib.py        1.00      3.6±0.01ms     4.6 MB/sec    1.00      3.7±0.00ms     4.5 MB/sec
linter/all-rules/numpy/globals.py          1.00    375.2±2.92µs     7.9 MB/sec    1.00    376.1±0.93µs     7.8 MB/sec
linter/all-rules/pydantic/types.py         1.00      6.3±0.01ms     4.1 MB/sec    1.01      6.3±0.01ms     4.0 MB/sec
linter/default-rules/large/dataset.py      1.01      7.5±0.02ms     5.4 MB/sec    1.00      7.5±0.01ms     5.4 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.00   1597.4±3.56µs    10.4 MB/sec    1.00   1591.0±1.98µs    10.5 MB/sec
linter/default-rules/numpy/globals.py      1.00    173.8±0.23µs    17.0 MB/sec    1.02    176.8±6.95µs    16.7 MB/sec
linter/default-rules/pydantic/types.py     1.00      3.4±0.01ms     7.5 MB/sec    1.00      3.4±0.01ms     7.5 MB/sec
parser/large/dataset.py                    1.00      5.7±0.01ms     7.2 MB/sec    1.02      5.8±0.01ms     7.0 MB/sec
parser/numpy/ctypeslib.py                  1.00   1126.2±0.69µs    14.8 MB/sec    1.01   1134.6±0.61µs    14.7 MB/sec
parser/numpy/globals.py                    1.00    115.3±0.31µs    25.6 MB/sec    1.02    117.4±0.38µs    25.1 MB/sec
parser/pydantic/types.py                   1.00      2.5±0.01ms    10.4 MB/sec    1.01      2.5±0.01ms    10.3 MB/sec

Windows

group                                      main                                   pr
-----                                      ----                                   --
linter/all-rules/large/dataset.py          1.00     22.3±0.71ms  1867.3 KB/sec    1.02     22.7±0.83ms  1832.9 KB/sec
linter/all-rules/numpy/ctypeslib.py        1.00      5.6±0.22ms     3.0 MB/sec    1.04      5.8±0.23ms     2.9 MB/sec
linter/all-rules/numpy/globals.py          1.00   668.5±31.94µs     4.4 MB/sec    1.04   693.1±40.81µs     4.3 MB/sec
linter/all-rules/pydantic/types.py         1.00      9.6±0.50ms     2.6 MB/sec    1.01      9.7±0.34ms     2.6 MB/sec
linter/default-rules/large/dataset.py      1.01     11.3±0.46ms     3.6 MB/sec    1.00     11.2±0.33ms     3.6 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.00      2.4±0.09ms     7.0 MB/sec    1.00      2.4±0.11ms     7.0 MB/sec
linter/default-rules/numpy/globals.py      1.02   279.5±14.61µs    10.6 MB/sec    1.00   274.3±14.48µs    10.8 MB/sec
linter/default-rules/pydantic/types.py     1.01      5.1±0.25ms     5.0 MB/sec    1.00      5.0±0.20ms     5.1 MB/sec
parser/large/dataset.py                    1.00      9.0±0.35ms     4.5 MB/sec    1.17     10.5±0.38ms     3.9 MB/sec
parser/numpy/ctypeslib.py                  1.00  1717.3±72.78µs     9.7 MB/sec    1.14  1952.4±63.56µs     8.5 MB/sec
parser/numpy/globals.py                    1.00   174.5±11.14µs    16.9 MB/sec    1.12    194.6±8.65µs    15.2 MB/sec
parser/pydantic/types.py                   1.00      3.8±0.17ms     6.7 MB/sec    1.15      4.4±0.13ms     5.9 MB/sec

calumy · 2023-05-25T12:38:30Z

crates/ruff/src/rules/pycodestyle/rules/too_many_blank_lines.rs

+/// ## Example
+/// ```python
+/// def func1():
+///     pass
+///
+///
+///
+/// def func2():
+///     pass
+/// ```


Because this issue is fixed by black, the rule name should be added to the KNOWN_FORMATTING_VIOLATIONS list to stop CI flagging that it should be fixed.

Is there a list somewhere with black's rules names ?

In this case, you would add too_many_blank_lines to the list as KNOWN_FORMATTING_VIOLATIONS relates to Ruff's rules rather than black.

I see, thanks!

hoel-bagard · 2023-05-26T03:26:27Z

I'm closing this PR since I'm redoing it using logical lines instead of physical lines.

hoel-bagard and others added 6 commits May 24, 2023 23:05

Add E303, need refactoring.

0471442

Made count go from top to bottom (instead of bottom to top) for reada…

d0753cd

…bility.

Added test fixture.

3a19cbb

Added rule to registry.

92bfeca

Added snapshot.

288d82a

Merge branch 'charliermarsh:main' into E303_too_many_blank_lines

8b5c3a6

charliermarsh reviewed May 24, 2023

View reviewed changes

crates/ruff/resources/test/fixtures/pycodestyle/E303.py Outdated Show resolved Hide resolved

MichaReiser reviewed May 24, 2023

View reviewed changes

hoel-bagard added 3 commits May 25, 2023 00:36

Pass a boolean indicating whether it is the first line or not.

e1f3e1d

Use pycodestyles' test suite

ff64a14

Updated snapshot.

d0b4b45

calumy reviewed May 25, 2023

View reviewed changes

hoel-bagard closed this May 26, 2023

hoel-bagard mentioned this pull request May 28, 2023

[pycodestyle] Add blank line(s) rules (E301, E302, E303, E304, E305, E306) #4694

Closed

hoel-bagard mentioned this pull request Nov 16, 2023

[pycodestyle] Add blank line(s) rules (E301, E302, E303, E304, E305, E306) #8720

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`pycodestyle`] Add `too many blank lines` (`E303`) #4635

[`pycodestyle`] Add `too many blank lines` (`E303`) #4635

hoel-bagard commented May 24, 2023

MichaReiser May 24, 2023

MichaReiser May 24, 2023

charliermarsh May 24, 2023

MichaReiser May 24, 2023

charliermarsh May 24, 2023

hoel-bagard May 25, 2023 •

edited

Loading

hoel-bagard May 25, 2023

MichaReiser May 24, 2023

github-actions bot commented May 24, 2023

calumy May 25, 2023

hoel-bagard May 25, 2023

calumy May 25, 2023

hoel-bagard May 25, 2023

hoel-bagard commented May 26, 2023

		previous_line_end = locator.full_line_end(previous_line_end);
		let previous_line = locator.line(previous_line_end);

[pycodestyle] Add too many blank lines (E303) #4635

[pycodestyle] Add too many blank lines (E303) #4635

Conversation

hoel-bagard commented May 24, 2023

Summary

Test Plan

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hoel-bagard May 25, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented May 24, 2023

PR Check Results

Benchmark

Linux

Windows

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hoel-bagard commented May 26, 2023

[`pycodestyle`] Add `too many blank lines` (`E303`) #4635

[`pycodestyle`] Add `too many blank lines` (`E303`) #4635

hoel-bagard May 25, 2023 •

edited

Loading