lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner #29744

davidgumberg · 2024-03-26T22:20:30Z

Motivated by a comment in #29479, this draft PR refactors the lint runner into multiple modules, and refactors a few of the python linters to reuse common file excluding logic.

The include guards lint now prints all missing include guards before exiting, and all of the rewritten linters now exclude all subtrees, including crypto/caes.

The lint runner now supports running individual linters by passing --lint=LINT_CHECK_TO_RUN to the lint runner. If using cargo run arguments can be passed e.g.:

cd test/lint/test_runner && cargo run -- --lint=doc --lint=includes

I can see reasons why the benefits of this PR might be outweighed by the downsides so I'm just publishing this as a draft to explore if further rewriting/refactoring of the lint suite in rust is desirable, and if so, what the best approach is.

DrahtBot · 2024-03-26T22:20:32Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage

For detailed information about the code coverage, see the test coverage report.

Reviews

See the guideline for information on the review process.

Type	Reviewers
Concept NACK	fjahr
Concept ACK	dergoegge

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#29494 (build: Assume HAVE_CONFIG_H, Add IWYU pragma keep to bitcoin-config.h includes by maflcko)
#28981 (Replace Boost.Process with cpp-subprocess by hebasto)
#22417 (util/system: Close non-std fds when execing slave processes by luke-jr)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

fjahr · 2024-03-26T22:50:00Z

I think this needs a conceptual discussion if we want to move all Python code to Rust eventually.

dergoegge · 2024-03-27T10:41:13Z

Concept ACK 🦀

maflcko · 2024-03-27T11:21:42Z

I think this needs a conceptual discussion if we want to move all Python code to Rust eventually.

I don't think moving all Python code to Rust makes sense. Simply due to the reason that reviewing such a large change is close to impossible.

$ cat $( git ls-files '*.py' ) | wc -l 
78641

See also my reply in another thread: #29479 (comment)

maflcko · 2024-03-27T11:25:44Z

The only expected change in behavior is that the include guards lint now prints all missing include guards before exiting.

I think you are missing that crypto/ctaes is a subtree, which previously was not ignored by some checks, and now is. It may be good to mention this.

maflcko · 2024-03-27T12:04:56Z

Also, looking at this again, test/lint/lint-locale-dependence.py and test/lint/lint-python-utf8-encoding.py were forgotten?

I think it could make sense to either add them to this pull, or correct the python list, and make them to use the python list.

fjahr · 2024-03-27T12:09:09Z

I don't think moving all Python code to Rust makes sense. Simply due to the reason that reviewing such a large change is close to impossible.

The tests are already split up into separate files and can be ported one at a time. We could pull in rust_bitcoin to get access to all the low-level stuff they have already written. It is a big project but certainly doable if Rust is considered a big enough upgrade. If it's just a minor improvement I don't think it's worth alienating the developers that don't know Rust from contributing to the linters.

maflcko · 2024-03-27T12:20:46Z

The tests are already split up into separate files and can be ported one at a time

Sure, but I think the functional test framework itself is already 10k lines of code. Having to support two test frameworks that are supposed to behave similar or identical may already be too difficult.

fjahr · 2024-03-27T13:38:55Z

Previously the linters were separated into their own files and could be called separately as well. This is now moving to one big monolytic program, which is not great. The lint_all function also doesn't actually run all linters anymore with this change, it only runs the python ones, not the rewritten ones, so it should probably be renamed?

Sure, but I think the functional test framework itself is already 10k lines of code. Having to support two test frameworks that are supposed to behave similar or identical may already be too difficult.

If such a rewrite is not worth it on a big scale we should also not do it on a small scale. The value and effort scale up proportionally with the size of the change. And if we are concerned about resources I think we also should do this here either because a small distraction is a distraction as well.

maflcko · 2024-03-27T18:50:10Z

Previously the linters were separated into their own files and could be called separately as well.

Good point. This should be trivial to fix. For example by having an environment variable to indicate which check to run. For example LINT=subtree cargo run to run lint_subtree, and LINT=help to print all LINT tests.

The lint_all function also doesn't actually run all linters anymore with this change, it only runs the python ones

I don't think this has changed. The linters written in bash were never run as part of "lint all", for example git-subtree-check.sh or test/lint/commit-script-check.sh. Also, test/lint/check-doc.py (written in python) was never run as part of "lint all".

Generally, I think more important than the choice of language is to actually fix the issue that was intended to be solved. That is, #29744 (comment) and #29744 (comment) . Whether the fix is done in Python or Rust should be secondary, as long as the fix is done.

Refactor lint runner into separate modules and use get_pathspecs_exclude* helpers everywhere.

Add support for passing `--lint=LINT_TO_RUN` to the rust lint runner.

davidgumberg · 2024-04-03T00:35:14Z

I've substantially revised this PR to address @fjahr's feedback that the rust lint runner monofile was getting too big, and that it doesn't support running individual tests and @maflcko's suggestion to include test/lint/lint-locale-dependence.py and test/lint/lint-python-utf8-encoding.py .

I am just OK at regex, and had to do some tricky things to avoid importing the rust regex crate, so I ask any reviewers to give special care to all of the regex segments, especially the pattern in fn get_locale_dependence_regexp(), I really appreciate your time and feedback!

I feel that while whether the added review and complexity burden of the changes here and, broadly, of having another language for everyone to be knowledgeable of outweighs the benefits remains an open question, (I am concept +0) it is possible for further rewriting of the lint suite to be worthwhile without it being the case that moving all Python code to Rust should be a good idea.

maflcko · 2024-04-05T13:09:08Z

test/lint/test_runner/src/linters/doc.rs

+    } else {
+        Err("".to_string())
+    }
+}


In 1ffdf3f: Why? What is the point to put a single function consisting of a single if in a new file and module? I fail to see how more boilerplate than the actual logic is beneficial.

If your point is that large files shouldn't exist, then I tend to disagree. For example, doc/developer-notes.md is large and exists. Same for ./src/rpc/blockchain.cpp.

My preference would be to not move code, unless there is a reason.

DrahtBot · 2024-04-10T10:55:35Z

🐙 This pull request conflicts with the target branch and needs rebase.

maflcko · 2024-04-15T11:37:03Z

Are you still working on this?

davidgumberg · 2024-04-15T12:24:15Z

Are you still working on this?

Yes, I was just waiting to see if there was any other feedback before putting all of the linter code back into a single file.

I think a lot of the reason for the individual lint checks being placed in separate files previously was so that they could be run individually, but I feel that a single file that contains both the lint checks and the lint runner is a bit clunky.

I would rather leave splitting up main.rs to the tastes of a future PR, unless someone has a strong opinion about how the files should be laid out.

fjahr · 2024-04-15T13:52:31Z

Concept NACK

As I have said before, I think there needs to be a discussion about using more Rust just for the sake of using more Rust in the project. The only motivation given so far is that @maflcko said that it could be done. There is no benefit to doing this otherwise apparently and the biggest effect I see is that there will be fewer people able to contribute to that part of the code base. And this change alone will also take up a lot of review effort so it appears to be a clear negative, at least in the short to mid-term.

maflcko · 2024-04-15T14:08:56Z

As I have said before, I think there needs to be a discussion about using more Rust just for the sake of using more Rust in the project. The only motivation given so far is that @maflcko said that it could be done.

Not sure where you get that from, but I never said that using more Rust for the sake of using more Rust can be done. I said something else:

I *don't* think moving all Python code to Rust makes sense. https://www.github.com/bitcoin/bitcoin/pull/29744#issuecomment-2022526794
Generally, I think more important than the choice of language is to actually fix the issue that was intended to be solved. That is, _ and _ . Whether the fix is done in Python or Rust should be secondary, as long as the fix is done. https://www.github.com/bitcoin/bitcoin/pull/29744#issuecomment-2023709475
Rust is type-safe, so many issues that are found after code is written, reviewed, and merged just can not happen and will be caught, ideally before a pull is reviewed and merged. For example ... https://www.github.com/bitcoin/bitcoin/pull/29479#issuecomment-2021251949

fjahr · 2024-04-15T14:15:06Z

Not sure where you get that from

"an alternative might be to rewrite the 3 lint checks to rust" #29479 (review) That is what @davidgumberg has given as the motivation in his description. It does not matter what you have said in other places because I have not been talking about that. I only talked about your comment that the author took as the motivation for this PR.

maflcko · 2024-04-15T14:45:30Z

That comment says so that they can use the already existing function as the motivation. (This is the issue that was attempted to be solved and why the other pull request was created in the first place). Re-reading my comment, I don't get the impression that it said "using more Rust just for the sake of using more Rust in the project can be done".

fjahr · 2024-04-15T14:49:24Z

That comment says so that they can use the already existing function as the motivation. (This is the issue that was attempted to be solved and why the other pull request was created in the first place).

Right, and that original PR was merged so that part does not apply here anymore, still this PR is still open.

Re-reading my comment, I don't get the impression that it said "using more Rust just for the sake of using more Rust in the project can be done".

Well, my impression is that the author got that impression and opened this PR because of it.

maflcko · 2024-04-15T14:58:56Z

Right, and that original PR was merged so that part does not apply here anymore, still this PR is still open.

My understanding is that the original PR didn't actually fix the issue, so the problem remains. For example, crypto/ctaes is a subtree, but test/lint/lint_ignore_dirs.py does not list it. Also, test/lint/lint-locale-dependence.py and test/lint/lint-python-utf8-encoding.py don't use test/lint/lint_ignore_dirs.py.

Let me know if I am missing something. If the issue is indeed fixed, this pull request can clearly be closed.

In any case, it seems better to close this pull request and start a fresh one, if needed, which clearly explains the issue/problem in its own words, and how it is fixed. I don't think the discussion in this pull request, and the fact that the patch changed after opening, makes it easy to follow.

fjahr · 2024-04-15T15:18:16Z

For example, crypto/ctaes is a subtree, but test/lint/lint_ignore_dirs.py does not list it. Also, test/lint/lint-locale-dependence.py and test/lint/lint-python-utf8-encoding.py don't use test/lint/lint_ignore_dirs.py.

Right, these are behavior changes that I did not see as part of the motivation of the original PR, as the title says it was a refactor. So I don't think this is something that was missed, it was just not the goal. It should be made more clear that this is the main motivation of this PR, to improve exclusion of subtrees. It should not be called a refactor in the title as it is now.

maflcko · 2024-04-22T12:11:43Z

Closing for now per previous discussion. Feel free to create a new pull request(s) with the behavior changes and an accurate description in your own words. Please try to avoid the use of the word "refactor" for bugfixes or behavior changes.

davidgumberg · 2024-04-26T00:19:13Z

@maflcko @fjahr Thank you for the feedback, sorry for the confusion on this PR. I've opened #29965

DrahtBot added the Tests label Mar 26, 2024

davidgumberg mentioned this pull request Mar 26, 2024

test: Refactor subtree exclusion in lint tests #29479

Merged

bitcoin deleted a comment from unicornsavior42 Mar 26, 2024

DrahtBot added the Needs rebase label Mar 27, 2024

DrahtBot mentioned this pull request Mar 27, 2024

Replace Boost.Process with cpp-subprocess #28981

Merged

DrahtBot mentioned this pull request Mar 27, 2024

util/system: Close non-std fds when execing slave processes #22417

Open

davidgumberg force-pushed the rustlin branch from a1b5e66 to 3b8de77 Compare April 2, 2024 23:52

davidgumberg added 10 commits April 2, 2024 20:01

Refactor lint runner into separate modules

7b5549d

Refactor lint runner into separate modules and use get_pathspecs_exclude* helpers everywhere.

Refactor rust linters into separate modules

1ffdf3f

Support individual linters in rust lint runner

871133f

Add support for passing `--lint=LINT_TO_RUN` to the rust lint runner.

Docs: Running individual lint checks

840436f

Rewrite include guard linter in rust

22898ad

Rewrite includes linter in rust

3cf58e7

Rewrite spelling linter in rust

8e49b1e

Rewrite locale dependence linter in Rust

9b02b26

Rewrite python encoding lints in rust

733b881

Codespell false positive crate->create

c239e90

davidgumberg force-pushed the rustlin branch from 3b8de77 to c239e90 Compare April 3, 2024 00:01

DrahtBot added the CI failed label Apr 3, 2024

davidgumberg changed the title ~~lint: Rewrite spelling, includes, and include guards linters in Rust~~ lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner Apr 3, 2024

DrahtBot changed the title ~~lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner~~ lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner Apr 3, 2024

DrahtBot removed Needs rebase CI failed labels Apr 3, 2024

DrahtBot mentioned this pull request Apr 5, 2024

build: Assume HAVE_CONFIG_H, Add IWYU pragma keep to bitcoin-config.h includes #29494

Merged

maflcko reviewed Apr 5, 2024

View reviewed changes

DrahtBot added the Needs rebase label Apr 10, 2024

maflcko closed this Apr 22, 2024

davidgumberg mentioned this pull request Apr 25, 2024

Lint: support running individual rust linters and improve subtree exclusion #29965

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner #29744

lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner #29744

davidgumberg commented Mar 26, 2024 •

edited

DrahtBot commented Mar 26, 2024 •

edited

fjahr commented Mar 26, 2024

dergoegge commented Mar 27, 2024 •

edited

maflcko commented Mar 27, 2024

maflcko commented Mar 27, 2024

maflcko commented Mar 27, 2024

fjahr commented Mar 27, 2024

maflcko commented Mar 27, 2024

fjahr commented Mar 27, 2024 •

edited

maflcko commented Mar 27, 2024

davidgumberg commented Apr 3, 2024 •

edited

maflcko Apr 5, 2024 •

edited

DrahtBot commented Apr 10, 2024

maflcko commented Apr 15, 2024

davidgumberg commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 22, 2024

davidgumberg commented Apr 26, 2024

lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner #29744

lint: Refactor lint checks to reuse exclusion logic; Support running individual lint checks in lint runner #29744

Conversation

davidgumberg commented Mar 26, 2024 • edited

DrahtBot commented Mar 26, 2024 • edited

Code Coverage

Reviews

Conflicts

fjahr commented Mar 26, 2024

dergoegge commented Mar 27, 2024 • edited

maflcko commented Mar 27, 2024

maflcko commented Mar 27, 2024

maflcko commented Mar 27, 2024

fjahr commented Mar 27, 2024

maflcko commented Mar 27, 2024

fjahr commented Mar 27, 2024 • edited

maflcko commented Mar 27, 2024

davidgumberg commented Apr 3, 2024 • edited

maflcko Apr 5, 2024 • edited

Choose a reason for hiding this comment

DrahtBot commented Apr 10, 2024

maflcko commented Apr 15, 2024

davidgumberg commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 15, 2024

fjahr commented Apr 15, 2024

maflcko commented Apr 22, 2024

davidgumberg commented Apr 26, 2024

davidgumberg commented Mar 26, 2024 •

edited

DrahtBot commented Mar 26, 2024 •

edited

dergoegge commented Mar 27, 2024 •

edited

fjahr commented Mar 27, 2024 •

edited

davidgumberg commented Apr 3, 2024 •

edited

maflcko Apr 5, 2024 •

edited