jwt.decode sonar codemod by clavedeluna · Pull Request #326 · pixee/codemodder-python

clavedeluna · 2024-03-02T15:09:31Z

Overview

Add codemod support for the sonar rule for jwt.decode

Description

The challenge with this codemod is matching sonar results column to decode call node column. Sonar returns column start/end for ....verify=False or ...."verify_signature": True while the semgrep node we care about is jwt.decode which contains these verify keywords. To get the codemod to run, I had to implement a codemod specific match_location that I called fuzzy.

Additional Details

some minor cleanup. /documentation

Closes #298

clavedeluna · 2024-03-02T15:20:32Z


 class TestSonarExceptionWithoutRaise(BaseIntegrationTest):
-    codemod = ExceptionWithoutRaise
+    codemod = SonarExceptionWithoutRaise


I realized this int test was importing the wrong codemod

clavedeluna · 2024-03-02T15:21:53Z

        pos_to_match = self.node_position(node)
        if self.results is None:
+            # Some codemods must run without results existing.
            return True


I documented this because it was strange to return True here (do we filter by result or not filter by result?). Then I realized the logic here is that this method is used for all codemods - those with and those without results. We still want to analyze codemods even if there are no results. Maybe later on we can have a filter_by_result that's more relevant. The one I implemented for jwt sonar returns False here, since there should always be results if we want to run the codemod.

Yes I encountered this too. We're using None as a sentinel for codemods that do not have detectors, and we do not want to short-circuit those cases. If you don't mind, could you update the comment to say something along those lines?

andrecsilva · 2024-03-04T12:24:55Z

+            same_line(pos, location) and fuzzy_column_match(pos, location)
+            for location in result.locations
+        )


Did you encounter any examples where fuzzy_column_match is needed? From my observations most nodes reported by sonar would match libcst ones with one exception: Tuples. If you look at the match_location from SonarResult you can see the exception treated there. We should discuss how to treat location match at some point.

You're going to have to give me an example because adding fuzzy_column_match is the entire point of this PR. As I stated in the PR description, sonar returns index for the keyword=value, while we need jwt.decode(...keyword=value). Unit tests have exact sonar results, too. So inspect those as well.

Or perhaps Im' not understanding your comment?

In the simplest terms, does anything break if you remove fuzzy_column_match and replace it by exact match?
If so, does it break consistently? (e.g. always with the same type of node)

Yes, there are no location matches without fuzzy matching.

drdavella

Looks great! Just a few smallish comments.

drdavella · 2024-03-04T14:15:37Z

+        results returned have a start/end column for the verify keyword
+        within the `decode` call, not for the entire call like semgrep returns.
+        """
+        if self.results is None:


I believe this will actually never occur in the Sonar case and you can remove this.

yep very good point. I was trying to keep this method as similar to its base one, but you're right it's nonsensical in this case.

drdavella · 2024-03-04T14:17:13Z

        pos_to_match = self.node_position(node)
        if self.results is None:
+            # Some codemods must run without results existing.
            return True


Yes I encountered this too. We're using None as a sentinel for codemods that do not have detectors, and we do not want to short-circuit those cases. If you don't mind, could you update the comment to say something along those lines?

drdavella · 2024-03-04T14:20:36Z

+        """
+        if self.results is None:
+            return False
+        match node:


This solution makes sense to me and it works for now. In the longer term, I think we should filter the Sonar/SAST results before they ever reach the transformer. This means we would add some logic to each detector that requires it. This would mean implementing some kind of visitor that validates each of the detected locations before passing the results to the transformer. There's a bit of a performance impact in this case since it effectively means another pass on the file, but it only applies to files where there are already results.

I'm saying this not because I think we need this change right now but because I've encountered similar issues with remediating another SAST tool and I think that updating the filtering logic here for each case is going to become cumbersome.

Very interesting! The detector for this particular codemod is semgrep so that's also a little wrinkle.

sonarqubecloud · 2024-03-05T11:34:57Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarCloud

clavedeluna changed the title ~~Jwt sonar~~ jwt.decode sonar codemod Mar 2, 2024

clavedeluna marked this pull request as ready for review March 2, 2024 15:20

clavedeluna requested review from andrecsilva and drdavella as code owners March 2, 2024 15:20

clavedeluna commented Mar 2, 2024

View reviewed changes

andrecsilva approved these changes Mar 4, 2024

View reviewed changes

drdavella approved these changes Mar 4, 2024

View reviewed changes

clavedeluna added 7 commits March 5, 2024 08:34

split jwt decode codemod into transformer

957a2c0

add jwt sonar codemod

c2e915d

refactor to result check

7640b2d

document code path

a416a78

fix integration test

d19c6dc

add integration tests

dce788f

update based on review

a5cb40b

clavedeluna force-pushed the jwt-sonar branch from ac0555f to a5cb40b Compare March 5, 2024 11:34

clavedeluna enabled auto-merge March 5, 2024 11:34

clavedeluna added this pull request to the merge queue Mar 5, 2024

Merged via the queue into main with commit badb2da Mar 5, 2024

clavedeluna deleted the jwt-sonar branch March 5, 2024 12:16

Conversation

clavedeluna commented Mar 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Description

Additional Details

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clavedeluna Mar 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

drdavella left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Mar 5, 2024

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

clavedeluna commented Mar 2, 2024 •

edited

Loading

clavedeluna Mar 4, 2024 •

edited

Loading