Add Error format support, and JSON output option #11396

tusharsadhwani · 2021-10-27T19:21:26Z

Description

Resolves #10816

The changes this PR makes are relatively small.
It currently:

Adds an --output option to mypy CLI
Adds a ErrorFormatter abstract base class, which can be subclassed to create new output formats
Adds a MypyError class that represents the external format of a mypy error.
Adds a check for --output being 'json', in which case the JSONFormatter is used to produce the reported output.

Demo:

$ mypy mytest.py              
mytest.py:2: error: Incompatible types in assignment (expression has type "str", variable has type "int")
mytest.py:3: error: Name "z" is not defined
Found 2 errors in 1 file (checked 1 source file)

$ mypy mytest.py --output=json
{"file": "mytest.py", "line": 2, "column": 4, "severity": "error", "message": "Incompatible types in assignment (expression has type \"str\", variable has type \"int\")", "code": "assignment"}
{"file": "mytest.py", "line": 3, "column": 4, "severity": "error", "message": "Name \"z\" is not defined", "code": "name-defined"}

A few notes regarding the changes:

I chose to re-use the intermediate ErrorTuples created during error reporting, instead of using the more general ErrorInfo class, because a lot of machinery already exists in mypy for sorting and removing duplicate error reports, which produces ErrorTuples at the end. The error sorting and duplicate removal logic could perhaps be separated out from the rest of the code, to be able to use ErrorInfo objects more freely.
ErrorFormatter doesn't really need to be an abstract class, but I think it would be better this way. If there's a different method that would be preferred, I'd be happy to know.
The --output CLI option is, most probably, not added in the correct place. Any help in how to do it properly would be appreciated, the mypy option parsing code seems very complex.
The ability to add custom output formats can be simply added by subclassing the ErrorFormatter class inside a mypy plugin, and adding a name field to the formatters. The mypy runtime can then check through the __subclasses__ of the formatter and determine if such a formatter is present.
The "checking for the name field" part of this code might be appropriate to add within this PR itself, instead of hard-coding JSONFormatter. Does that sound like a good idea?

…output-json

intgr · 2021-10-28T14:06:37Z

mypy/error_formatter.py

+        return json.dumps({
+            'file': file,
+            'line': line,
+            'column': column,


line, column seems short-sighed.

Maybe startLine, startColumn, which would leave room to later add endLine, endColumn. This information is useful for IDEs to know how what span to highlight.

Yeah, good point.
But mypy currently only does line and column, and it might be very long before spans are added in. It could be argued that the change to startLine and startColumn can be made when the feature exists.

Changing the field names later will break tools though. And startLine, startColumn would already be accurate right now, because mypy currently points out the start of the span.

I'll leave this convention for the separate --output=sarif format.

intgr · 2021-10-28T15:31:57Z

I'd like to make the case again for the existing JSON-based standard: Static Analysis Results Interchange Format (SARIF)

While that does not preclude support for a custom JSON format as well, perhaps if mypy were to support SARIF, there would be no need for a custom format?

Pros of SARIF:

There is no need to reinvent the wheel.
Can already integrate with existing tools like GitHub code scanning, Visual Studio, VS Code
Every IDE shouldn't have to implement its own mypy-specific integration. This is exemplified by the situation in IntelliJ IDEA/PyCharm: there are two 3rd party mypy-specific plugins, both of which have major problems that aren't getting fixed. [1]

Clearly the momentum isn't there to provide good mypy-specific IDE integrations. Adding mypy support for SARIF, and letting IDEs take care of parsing SARIF and implementing a good UI on top, seems like a far more sustainable option.

Cons:

SARIF is clearly infected with "design by committee", the data structures are verbose and deeply nested. However, advanced features are optional, a minimal implementation is straightforward, see example below.
No streaming output; all results would have to be collected before the JSON output can be serialized.

[1] The plugin ratings are 3.4 and 2.9 out of 5 😖 https://plugins.jetbrains.com/search?search=mypy

Example minimal mypy SARIF output

{
  "version": "2.1.0",
  "$schema": "https://schemastore.azurewebsites.net/schemas/json/sarif-2.1.0.json",
  "runs": [
    {
      "tool": {
        "driver": {
          "name": "mypy",
          "version": "0.910"
        }
      },
      "results": [
        {
          "ruleId": "assignment",
          "level": "error",
          "message": {
            "text": "Incompatible types in assignment (expression has type \"str\", variable has type \"int\")"
          },
          "locations": [
            {
              "physicalLocation": {
                "artifactLocation": {
                  "uri": "mytest.py"
                },
                "region": {
                  "startLine": 2,
                  "startColumn": 4
                }
              }
            }
          ]
        },
        {
          "ruleId": "name-defined",
          "level": "error",
          "message": {
            "text": "Name \"z\" is not defined"
          },
          "locations": [
            {
              "physicalLocation": {
                "artifactLocation": {
                  "uri": "mytest.py"
                },
                "region": {
                  "startLine": 3,
                  "startColumn": 4
                }
              }
            }
          ]
        }
      ]
    }
  ]
}

nvuillam · 2021-12-08T17:42:56Z

+1 for SARIF format, let's use a common format instead of a mypy custom one ! :)

tusharsadhwani · 2021-12-08T18:46:07Z

I'm willing to turn it into SARIF (basing it on the json snippet provided by @intgr), if that's what I need to get this into mypy 😄

@ethanhs @TH3CHARLie what do you think?

intgr · 2021-12-08T18:57:48Z

After sitting on it a little, I feel that there's room for more than one machine-readable format. The simpler json-line-based format is probably better for simple tools that only care about mypy. I'm willing to implement the SARIF part myself.

But I shouldn't be the one to decide that, I'm just an occasional lurker here. No mypy developers have chipped in yet.

nvuillam · 2021-12-08T19:07:58Z

I'm just a tourist here, but i'm currently activating SARIF output for all linters of MegaLinter, and having native SARIF output is a great benefit for linters ^^ ( + the Github CodeQL that natively understands SARIF format ^^ )

Some other python linters already have SARIF output, like bandit , maybe there is some code to reuse to manage the format ?

tusharsadhwani · 2021-12-08T19:09:57Z

@nvuillam this PR introduces an ErrorFormatter class. By the time this PR is finalized, even if it doesn't use SARIF you will be able to define your own ErrorFormatter class in a plugin probably, and tell it to output the SARIF format, it'll be really easy.

Pylint has the same setup.

nvuillam · 2021-12-08T19:13:05Z

@tusharsadhwani thank but... I don't have the bandwidth to implement SARIF output on all linters that do not manage it yet 😅

intgr · 2021-12-08T19:47:19Z

I think one property that machine-readable formats should have is: if one error causes multiple lines of output, then that should appear as one result item rather than multiple.

So for example with mypy --show-error-context

_local/multi_error.py: note: In function "foo":
_local/multi_error.py:5:9: error: No overload variant of "get" of "Mapping" matches argument type "str"  [call-overload]
_local/multi_error.py:5:9: note: Possible overload variants:
_local/multi_error.py:5:9: note:     def get(self, key: Any) -> Any
_local/multi_error.py:5:9: note:     def [_T] get(self, key: Any, default: Union[Any, _T]) -> Union[Any, _T]
Found 1 error in 1 file (checked 1 source file)

should maybe be output as

{
  "file": "_local/multi_error.py",
  "line": 5,
  "column": 8,
  "severity": "error",
  "context": "In function \"foo\":",
  "message": "No overload variant of \"get\" of \"Mapping\" matches argument type \"str\"",
  "hint": "Possible overload variants:\n    def get(self, key: Any) -> Any\n    def [_T] get(self, key: Any, default: Union[Any, _T]) -> Union[Any, _T]",
  "code": "call-overload"
}

But again, maybe that shouldn't be a blocker for this PR, getting machine-readable output can be useful even when findings aren't accurately grouped.

The negative line/column numbers right now seem awkward though:

{"file": "_local/multi_error.py", "line": -1, "column": -1, "severity": "note", "message": "In function \"foo\":", "code": null}

tusharsadhwani · 2021-12-08T19:49:05Z

That's definitely a bug.

The mypy code that generates this output was ridiculously coupled. I'll take a look at if this can be fixed.

TomMD · 2022-01-14T20:46:44Z

It has been many months as the project has hoped for a best solution over an existing solution. Can we merge this and accept future improvement using --output serif if someone comes along and implements it?

tusharsadhwani · 2022-01-19T09:30:14Z

@intgr I did it properly this time. Hints should be fixed now.

sehyun-hwang · 2022-02-22T07:44:00Z

--output json option gets ignored after some usage with mypy.api.run_dmypy . Pasing json option to dmypy CLI worked fine, but invoking run_dmypy function in a Python script returns plain string output after some usage. Can someone help me troubleshoot this?

tusharsadhwani · 2022-02-22T07:51:39Z

@sehyun-hwang sure thing. If you could provide a reproducible example I can look into it right away.

sehyun-hwang · 2022-02-23T01:59:15Z

@tusharsadhwani Thank you! I tried linting jobs in a container, and was unable to reproduce it. The problem occurs only when my IDE invokes dmypy, so I'm suspecting this has to do with concurrent execution. Do you see a chance where --output json option gets ignored when multiple clients are connected, or clients are threaded or running in a child process?

sehyun-hwang · 2022-02-23T02:28:51Z

I found that the first file linted output a json format, but from the second one it goes back to the string format. Can you try to reproduce this?

centos@www /m/tax-automation (99-basic-types) [1]> dmypy check -- batch_api.py
{"file": "batch_api.py", "line": 5, "column": 0, "message": "Cannot find implementation or library stub for module named \"LAMBDA_CONFIG\"", "hint": "See https://mypy.readthedocs.io/en/stable/running_mypy.html#missing-imports", "code": "import"}
{"file": "batch_api.py", "line": 6, "column": 0, "message": "Cannot find implementation or library stub for module named \"common.utils\"", "hint": "", "code": "import"}
centos@www /m/tax-automation (99-basic-types) [1]> dmypy check -- foo.py
foo.py:2: error: Name "bar" is not defined
foo.py:5: error: Argument 1 to "foo" has incompatible type "str"; expected "int"```

tusharsadhwani · 2022-02-23T06:30:32Z

@sehyun-hwang This seems like a dmypy bug. It seems that dmypy ignores cli flags or config file flags in certain cases.

Here's a file structure:

$ tree .
.
├── a.py
├── b.py
└── mypy.ini

And mypy.ini contains the following:

[mypy]
strict = True

a.py and b.py both contain:

def foo(): pass

This way mypy a.py won't show errors but mypy --strict a.py will.
When alternating between dmypy check a.py and dmypy check b.py, at some point dmypy stops showing strict output. For the same reason it might be forgetting the --output flag being set as well.

sehyun-hwang · 2022-02-23T07:18:16Z

#11396 (comment)

@intgr Could you take a look at the comment by @tusharsadhwani ?

intgr · 2022-02-23T10:10:52Z

I'm afraid I can't help much with this. Just to clarify, I think my name shows up in the "reviewers" list only because I left a code comment about this PR earlier. I'm not a maintainer or reviewer in an official capacity and I don't have much knowledge of mypy's internals.

I'm just a user interested in consuming structured output from mypy (and potentially adding SARIF support later on). That's why I shared my opinions in this comment thread.

github-actions · 2023-04-27T23:13:19Z

According to mypy_primer, this change has no effect on the checked open source code. 🤖🎉

github-actions · 2023-04-28T05:24:32Z

According to mypy_primer, this change has no effect on the checked open source code. 🤖🎉

for more information, see https://pre-commit.ci

tusharsadhwani · 2023-04-28T09:50:10Z

@sobolevn PR is ready for review! I've added tests, and taken care of code review comments by Ivan.

github-actions · 2023-05-05T05:15:11Z

Diff from mypy_primer, showing the effect of this PR on open source code:

pyinstrument (https://github.com/joerick/pyinstrument) got 1.58x faster (14.1s -> 8.9s)

hauntsaninja · 2023-05-09T20:33:04Z

mypy/errors.py

+
+        error_info = self.error_info_map[path]
+        if formatter is not None:
+            error_info = [info for info in error_info if not info.hidden]


Let's not duplicate this and instead pass error_tuples into format_messages

github-actions · 2023-05-10T07:07:30Z

According to mypy_primer, this change doesn't affect type check results on a corpus of open source code. ✅

hauntsaninja

I'm mostly okay with this. The one thing I'm a little concerned about is the additional layer of translation we're doing to this MypyError class.

Why can't we just emit the fields of the ErrorTuple basically as is? This keeps the mapping from normal mypy output to JSON mypy output trivial. Users interested in this can munge however best, e.g. I can imagine folks being interested in the output of reveal_type which is of severity "note" that this PR currently drops. Keeping additional translation here minimal reduces maintenance risk.

tusharsadhwani · 2023-05-12T07:38:56Z

@hauntsaninja my concern here was whether the ErrorTuple type can change and break compatibility.

If we plan on, in the future, let people pass their own Formatter type through a plugin or something (which I think is a great idea), then changing ErrorTuple will break all formatters.

I've myself seen ErrorTuple add two more fields in the last year, for end line and column. So that was the original concern: to not leak the internals out.

Even if we don't end up doing custom formatters, I think the decoupling is worth it for easier definition of new output formats.

hauntsaninja · 2023-05-13T06:18:27Z

Hmm, adding fields shouldn't be a break. I'm okay with doing a minor refactoring, e.g. at the least we should make it a NamedTuple. Overall, I think easier to maintain ErrorTuple than an additional translation layer. And if we do think we'd break ErrorTuple and we think that breaking it isn't acceptable, we could always add the decoupling from internals then.

tusharsadhwani · 2023-05-13T07:14:57Z

~~Alright, I'll remove MypyError then.~~ Check message below, I don't think it's doable.

tusharsadhwani · 2023-05-13T07:49:13Z

@hauntsaninja I checked again, and now I remember why I did the MypyError thing: There's no direct relation between an ErrorTuple and a single error.

You can have 3 ErrorTuple's like this:
("hint", -1, -1, "In class 'Foo':")
("hint", -1, -1, "On line 32:")
("error", 32, 4, "Mismatched return type, expected 'int', got 'str'")

Due to this, there's no direct way to turn ErrorTuple's into JSON. You need to do some sort of processing, merging the hints etc. before you can output to json.

So I don't think we can get rid of the translation from tuples to a different object.

github-actions · 2023-05-13T08:11:46Z

Diff from mypy_primer, showing the effect of this PR on open source code:

dacite (https://github.com/konradhalas/dacite) got 3.06x faster (8.3s -> 2.7s)

AngellusMortis · 2023-07-06T23:00:56Z

Is there anything still blocking this PR from being merged?

tusharsadhwani · 2023-07-07T08:14:23Z

@AngellusMortis #15273 is all

sabiroid · 2023-12-28T22:24:40Z

@hauntsaninja, is anything still blocking this PR still or it is good to go?

8tm · 2024-02-06T22:52:09Z

New year, new changes, new merge?
I just wanted to inform you that developers are waiting and eager for new features 😄

tusharsadhwani added 7 commits October 24, 2021 19:54

Add -O/--output CLI option

393820c

Initial formatter setup

282bd28

Make error_formatter an optional argument

ccda5b0

Fix type annotation

c849a77

Fix whitespace

fd2feab

Remove whitespace

b188001

Merge branch 'master' of https://github.com/tusharsadhwani/mypy into …

51c1acc

…output-json

intgr reviewed Oct 28, 2021

View reviewed changes

tushar-deepsource and others added 3 commits January 19, 2022 13:01

Merge branch 'python:master' into output-json

9177dab

Add hint property to errors

bc5ceac

Fix lint issues

9d29ab0

97littleleaf11 requested a review from JukkaL February 22, 2022 07:46

tusharsadhwani added 2 commits April 28, 2023 04:04

unused import

7a3f736

trailing whitespace

6d46f75

try fixing windows

e71a372

tusharsadhwani and others added 4 commits April 28, 2023 11:43

fix windows separator issue

8cca203

[pre-commit.ci] auto fixes from pre-commit.com hooks

79e16a8

for more information, see https://pre-commit.ci

unused import

8bf4890

Merge branch 'master' into output-json

5899f26

Merge branch 'master' into output-json

880b8f3

hauntsaninja reviewed May 9, 2023

View reviewed changes

hauntsaninja added the upnext label May 9, 2023

tusharsadhwani added 2 commits May 10, 2023 12:18

Pass error tuples to format_messages

0aafadf

Merge branch 'master' into output-json

4cab249

tusharsadhwani requested a review from hauntsaninja May 10, 2023 07:26

hauntsaninja reviewed May 12, 2023

View reviewed changes

Merge branch 'master' into output-json

7fe71c3

hauntsaninja mentioned this pull request May 21, 2023

Improve multiline reporting #15273

Open

Add Error format support, and JSON output option #11396

Are you sure you want to change the base?

Add Error format support, and JSON output option #11396

Conversation

tusharsadhwani commented Oct 27, 2021 • edited

Description

Demo:

intgr Oct 28, 2021

Choose a reason for hiding this comment

tusharsadhwani Oct 28, 2021

Choose a reason for hiding this comment

intgr Oct 28, 2021 • edited

Choose a reason for hiding this comment

tusharsadhwani Jan 19, 2022

Choose a reason for hiding this comment

intgr commented Oct 28, 2021 • edited

nvuillam commented Dec 8, 2021

tusharsadhwani commented Dec 8, 2021

intgr commented Dec 8, 2021

nvuillam commented Dec 8, 2021

tusharsadhwani commented Dec 8, 2021 • edited

nvuillam commented Dec 8, 2021

intgr commented Dec 8, 2021

tusharsadhwani commented Dec 8, 2021

TomMD commented Jan 14, 2022

tusharsadhwani commented Jan 19, 2022

sehyun-hwang commented Feb 22, 2022

tusharsadhwani commented Feb 22, 2022

sehyun-hwang commented Feb 23, 2022

sehyun-hwang commented Feb 23, 2022

tusharsadhwani commented Feb 23, 2022

sehyun-hwang commented Feb 23, 2022

intgr commented Feb 23, 2022 • edited

github-actions bot commented Apr 27, 2023

github-actions bot commented Apr 28, 2023

tusharsadhwani commented Apr 28, 2023

github-actions bot commented May 5, 2023

hauntsaninja May 9, 2023

Choose a reason for hiding this comment

github-actions bot commented May 10, 2023

hauntsaninja left a comment • edited

Choose a reason for hiding this comment

tusharsadhwani commented May 12, 2023 • edited

hauntsaninja commented May 13, 2023 • edited

tusharsadhwani commented May 13, 2023 • edited

tusharsadhwani commented May 13, 2023

github-actions bot commented May 13, 2023

AngellusMortis commented Jul 6, 2023

tusharsadhwani commented Jul 7, 2023

sabiroid commented Dec 28, 2023

8tm commented Feb 6, 2024

tusharsadhwani commented Oct 27, 2021 •

edited

intgr Oct 28, 2021 •

edited

intgr commented Oct 28, 2021 •

edited

tusharsadhwani commented Dec 8, 2021 •

edited

intgr commented Feb 23, 2022 •

edited

hauntsaninja left a comment •

edited

tusharsadhwani commented May 12, 2023 •

edited

hauntsaninja commented May 13, 2023 •

edited

tusharsadhwani commented May 13, 2023 •

edited