Re-add dead code detection #21096

flack · 2021-02-06T19:09:53Z

This re-adds unreachable code detection for Python based on vulture.

Effectively, this reverts f4beb49. The difference to the previous version is that this runs with the --min-confidence 100 setting. From https://pypi.org/project/vulture/:

Use --min-confidence 100 to only report code that is guaranteed to be unused within the analyzed files.

So this should avoid the previous issues where static analysis had wrong positives due to the dynamic nature of Python code by only reporting things that are unambiguous (such as code after a return statement). As such, there is not suppressions list.

My motivation was mainly #21081 which would have been caught by this (as can be seen by the CI run failing). This is still marked as draft because #21081 is needed to get the linter to pass. Also, there is a second problem that this found (see https://github.com/bitcoin/bitcoin/pull/19509/files#r571454691). From what I can tell, this is a spurious type comment that could just be removed (or if that line has no side effects it could also be deleted altogether?). I could add a commit here to fix it, but I wanted to see if there is interest in having this linter again in the first place

flack · 2021-02-06T19:16:36Z

meh, turns out clicked the wrong button, and now I can't mark it as Draft anymore... modified the title instead

flack · 2021-02-08T18:53:22Z

#21081 has been merged and the other issue I mentioned has been addressed in #21107, so this should now be ready for review. I also bumped the vulture dependency to 2.3 since it turns out Python 3.5 support is no longer needed

maflcko · 2021-02-08T19:05:03Z

Please squash your commits according to https://github.com/bitcoin/bitcoin/blob/master/CONTRIBUTING.md#squashing-commits

flack · 2021-02-08T20:16:51Z

@MarcoFalke done

fanquake · 2021-02-09T03:34:23Z

Seems like at --min-confidence=100 vulture will just report unreachable code.
You have to drop it to 90 for it to report unused imports:

diff --git a/test/functional/test_framework/key.py b/test/functional/test_framework/key.py
index e0cbab45c..d0f3472e9 100644
--- a/test/functional/test_framework/key.py
+++ b/test/functional/test_framework/key.py
@@ -14,6 +14,10 @@ import unittest
 
 from .util import modinv
 
+import nonsense
+
 def TaggedHash(tag, data):
     ss = hashlib.sha256(tag.encode('utf-8')).digest()
     ss += ss

bitcoin/test/functional/test_framework/key.py:17: unused import 'nonsense' (90% confidence)

or 60 before it will report unused functions, i.e xor_bytes in #21100:

bitcoin/test/functional/test_framework/key.py:23: unused function 'xor_bytes' (60% confidence)

Looking at the vulture source this makes sense, because the default confidence for basically all checks is 60.

c9095b7 test: remove unnecessary assignment in bdb (Bruno Garcia) Pull request description: This PR removes the unnecessary assignment to page_info['entries'] on line 54 since there is another assignment for it in line 59. I think a lint (bitcoin#21096) would detect cases like this one. ACKs for top commit: achow101: ACK c9095b7 theStack: Code Review ACK c9095b7 Tree-SHA512: 23377077c015b04361fd416b41bf6806ad0bdd4d264be6760f0fd3bc88d694d2cd52cae250519925c5d3b3c70715772714c3863f8fa181a2eb4883204ccdbf9d

laanwj · 2021-02-10T08:38:43Z

~~NACK on this. I still stand by discussion in #16961. Even having to fine-tune the threshold value just seems a recipe for endless frustration.~~

flack · 2021-02-10T10:07:39Z

@laanwj I'm not going to spend too much time arguing, but the rationale for #16961 was to avoid false positives. --min-confidence=100 accomplished that, since it is guaranteed to only find dead code (inside of if (false), after return statement or similar). So there isn't any need to fine-tune anything. This is just a little sanity check that even on my five year old laptop completes in less than one second, and it would have caught an instance of xkcd 221 in the taproot tests, which is probably one of the better-reviewed PRs. What's not to like?

test/lint/lint-python-dead-code.sh

practicalswift · 2021-02-10T21:21:29Z

Concept ACK

If simply specifying --min-confidence 100 means that the false positives we've seen in the past are avoided then this should be an obvious win, no? :)

@laanwj, is your NACK made assuming that --min-confidence 100 is not sufficient to solve the false positive issues we've seen in the past? Not questioning: just trying to understand the reasoning :)

laanwj · 2021-02-11T15:45:06Z

Okay retracted the NACK if everyone else wants this it's okay with me I just would really prefer if it doesn't come up as issue again.

practicalswift · 2021-02-11T16:18:46Z

test/lint/lint-python-dead-code.sh

+# Any value below 100 introduces the risk of false positives, which would create an unacceptable maintenance burden.
+if ! vulture \
+    --min-confidence 100 \
+    $(git rev-parse --show-toplevel); then


Should this be $(vulture --min-confidence 100 -- $(git ls-files -- "*.py")) instead to make sure only files in the repo are checked?

Could be changed if people think that's better. The rev-parse version is what the previous incarnation of the linter had. I was thinking maybe you want to run the linter against a new file before you git add it?

AFAICT, vulture automatically filters by file extension (https://github.com/jendrikseipp/vulture/blob/master/vulture/config.py#L105), so *.py should be implicit

don't think this is a serious issue but, ostensibly a developer could, unwisely, keep their own .py scripts inside the repo directory, which would make no sense to check here, so i think ls-files to only check the committed ones makes sense

Alright, changed to ls-files as requested

practicalswift · 2021-02-13T15:20:05Z

ACK 3f8776a

maflcko · 2021-02-15T14:13:40Z

Tested ACK by reintroducing the bug, didn't review:

/tmp/cirrus-ci-build/test/functional/feature_taproot.py:521: unreachable code after 'return' (100% confidence)
^---- failure generated from test/lint/lint-python-dead-code.sh

3f8776a Re-add dead code detection (flack) Pull request description: This re-adds unreachable code detection for Python based on `vulture`. Effectively, this reverts f4beb49. The difference to the previous version is that this runs with the `--min-confidence 100` setting. From https://pypi.org/project/vulture/: > Use `--min-confidence 100` to only report code that is guaranteed to be unused within the analyzed files. So this should avoid the previous issues where static analysis had wrong positives due to the dynamic nature of Python code by only reporting things that are unambiguous (such as code after a `return` statement). As such, there is not suppressions list. My motivation was mainly bitcoin#21081 which would have been caught by this (as can be seen by the CI run failing). This is still marked as draft because bitcoin#21081 is needed to get the linter to pass. Also, there is a second problem that this found (see https://github.com/bitcoin/bitcoin/pull/19509/files#r571454691). From what I can tell, this is a spurious type comment that could just be removed (or if that line has no side effects it could also be deleted altogether?). I could add a commit here to fix it, but I wanted to see if there is interest in having this linter again in the first place ACKs for top commit: practicalswift: ACK 3f8776a Tree-SHA512: 52314ad4f627d969de1eb15375ca677ed86a2e816fe773756a1ce22421214ba407b5a09a4bf701a3aab1a10c7b336f548e4cef3327edf154acba55e987db21f6

flack changed the title ~~Re-add dead code detection~~ [draft] Re-add dead code detection Feb 6, 2021

maflcko marked this pull request as draft February 6, 2021 20:15

DrahtBot added the Tests label Feb 6, 2021

fanquake changed the title ~~[draft] Re-add dead code detection~~ Re-add dead code detection Feb 7, 2021

flack marked this pull request as ready for review February 8, 2021 18:53

brunoerg mentioned this pull request Feb 9, 2021

test: remove unnecessary assignment in bdb #21124

Merged

maflcko reviewed Feb 10, 2021

View reviewed changes

test/lint/lint-python-dead-code.sh Show resolved Hide resolved

practicalswift reviewed Feb 11, 2021

View reviewed changes

Re-add dead code detection

3f8776a

maflcko merged commit d19639d into bitcoin:master Feb 15, 2021

This was referenced Mar 1, 2022

test: Add dead code detection gridcoin-community/Gridcoin-Research#2446

Closed

test: Add dead code detection gridcoin-community/Gridcoin-Research#2449

Merged

bitcoin locked as resolved and limited conversation to collaborators Aug 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-add dead code detection #21096

Re-add dead code detection #21096

flack commented Feb 6, 2021

flack commented Feb 6, 2021

flack commented Feb 8, 2021

maflcko commented Feb 8, 2021

flack commented Feb 8, 2021

fanquake commented Feb 9, 2021

laanwj commented Feb 10, 2021 •

edited

Loading

flack commented Feb 10, 2021

practicalswift commented Feb 10, 2021

laanwj commented Feb 11, 2021

practicalswift Feb 11, 2021

flack Feb 12, 2021

laanwj Feb 12, 2021

flack Feb 13, 2021

practicalswift commented Feb 13, 2021

maflcko commented Feb 15, 2021

Re-add dead code detection #21096

Re-add dead code detection #21096

Conversation

flack commented Feb 6, 2021

flack commented Feb 6, 2021

flack commented Feb 8, 2021

maflcko commented Feb 8, 2021

flack commented Feb 8, 2021

fanquake commented Feb 9, 2021

laanwj commented Feb 10, 2021 • edited Loading

flack commented Feb 10, 2021

practicalswift commented Feb 10, 2021

laanwj commented Feb 11, 2021

practicalswift Feb 11, 2021

Choose a reason for hiding this comment

flack Feb 12, 2021

Choose a reason for hiding this comment

laanwj Feb 12, 2021

Choose a reason for hiding this comment

flack Feb 13, 2021

Choose a reason for hiding this comment

practicalswift commented Feb 13, 2021

maflcko commented Feb 15, 2021

laanwj commented Feb 10, 2021 •

edited

Loading