Add flag to allow more flexible variable redefinition #18727

JukkaL · 2025-02-24T14:17:46Z

Infer union types for simple variables from multiple assignments, if the variable isn't annotated. The feature is enabled via --allow-redefinition-new. --local-partial-types must also be enabled.

This is still experimental and has known issues, so it's not documented anywhere. It works well enough that it can be used for non-trivial experimentation, however.

Closes #6233. Closes #6232. Closes #18568. Fixes #18619.

In this example, the type of x is inferred as int | str when using the new behavior:

def f(i: int, s : str) -> int | str:
    if i > 5:
        x = i
    else:
        x = s  # No longer an error
    reveal_type(x)  # int | str
    return s

Here is a summary of how it works:

Assignment widens the inferred type of a variable and always narrows (when there is no annotation).
Simple variable lvalues are put into the binder on initial assignment when using the new feature. We need to be able to track whether a variable is defined or not to infer correct types (see Binder loses narrowed type of a variable if variable may be uninitialized #18619).
Assignment of None values are no longer special, and we don't use partial None if the feature is enabled for simple variables.
Lvalues other than simple variables (e.g. self.x) continue to work as in the past. Attribute types can't be widened, since they are externally visible and widening could cause confusion, but this is something we might relax in the future. Globals can be widened, however. This seems necessary for consistency.
If a loop body widens a variable type, we have to analyze the body again. However, we only do one extra pass, since the inferred type could be expanded without bound (consider x = 0 outside loop and x = [x] within the loop body).
We first infer the type of an rvalue without using the lvalue type as context, as otherwise the type context would often prevent redefinition. If the rvalue type isn't valid for inference (e.g. list item type can't be inferred), we fall back to the lvalue type context.

There are some other known bugs and limitations:

Annotated variables can't be freely redefined (but they can still be narrowed, of course). I may want to relax this in the future, but I'm not sure yet.
If there is a function definition between assignments to a variable, the inferred types may be incorrect.
There are few tests for nonlocal and some other features. We don't have good test coverage for deferrals, mypy daemon, and disabling strict optional.
Imported names can't be redefined in a consistent way. This needs further analysis.

In self check the feature generates 6 additional errors, which all seem correct -- we infer more precise types, which will generate additional errors due to invariant containers and fixing false negatives.

When type checking the largest internal codebase at Dropbox, this generated about 700 new errors, the vast majority of which seemed legitimate. Mostly they were due to inferring more precise types for variables that used to have Any types. I used a recent but not the latest version of the feature to type check the internal codebase.

…ontext

This lets us see the impact on mypy primer.

JukkaL · 2025-03-12T17:17:40Z

Temporarily enabled --allow-redefine-new by default. I also had to enable --local-partial-types. Note that both of these are expected to generate new errors.

JukkaL · 2025-03-14T15:40:02Z

There are still some false positives in the mypy_primer output that I want to fix before merging, but I won't attempt to fix every single issue in this PR, since the new behavior is behind a flag.

ilevkivskyi · 2025-03-14T16:00:48Z

@JukkaL

There are still some false positives in the mypy_primer output that I want to fix before merging, but I won't attempt to fix every single issue in this PR, since the new behavior is behind a flag.

Sure, there is no point in turning this into a mega-PR. Let me know if/when you want me to review this PR again.

hauntsaninja · 2025-03-14T23:29:04Z

Btw if you want a clean mypy_primer diff on the local partial types front, you can try something like:

diff --git a/.github/workflows/mypy_primer.yml b/.github/workflows/mypy_primer.yml
index ee8684847..a8220e1b6 100644
--- a/.github/workflows/mypy_primer.yml
+++ b/.github/workflows/mypy_primer.yml
@@ -66,6 +66,7 @@ jobs:
             --num-shards 5 --shard-index ${{ matrix.shard-index }} \
             --debug \
             --additional-flags="--debug-serialize" \
+            --additional-flags="--local-partial-types" \
             --output concise \
             | tee diff_${{ matrix.shard-index }}.txt
           ) || [ $? -eq 1 ]

JukkaL · 2025-03-17T14:06:49Z

@hauntsaninja

Btw if you want a clean mypy_primer diff on the local partial types front, you can try something like: ...

Since the PR also changes how None types are inferred, this actually generates a larger diff. I'm not sure if there is a good way to isolate the changes caused by the flag in mypy primer.

This reverts commit 24afddd.

JukkaL · 2025-03-17T14:10:23Z

@ilevkivskyi I'm not planning any further changes before merging. I'd like to merge this in a few days. It would be great if you can review my recent changes.

…l-types" This reverts commit 69fd839.

github-actions · 2025-03-17T14:37:21Z

According to mypy_primer, this change doesn't affect type check results on a corpus of open source code. ✅

ilevkivskyi

LG, thanks!

jorenham · 2025-03-19T11:33:40Z

Thanks @JukkaL !

The feature was introduced in #18727.

JukkaL added 30 commits February 24, 2025 13:52

WIP some initial prototyping

9735391

WIP add failing test case

946d4d5

WIP minimal support for merging control flow

7f7e25f

Add globals test case

0c9f049

Add class body test case

9935537

Require --local-partial-types

fd12f85

Fix optional types

b489b6a

Add partial type test cases

d2b7558

Pass options consistently

63bf0af

Fix interaction with Final

ec02654

Add annotated variable test case

4290857

Add test

5c8ab17

Add failing test

c116451

Only use type context in assignment if inference fails without type c…

9794308

…ontext

Always use type annotation as context

a355473

Add while loop test case

8162a45

Fix type inference in loops

b1f75a7

Update tests

62dfbb0

Remove underscore special case

bb6a246

Don't perform renaming when using new semantics

51c61c7

Fix for loops

0384795

WIP failing tests

a003c83

Add try/except test case

a35870e

Add match statement test

dea9148

Add simple nested function test case

2b104dc

Update globals tests case

6a05baf

Add assignment expression test case

b58be6a

Add lambda test case

5b31950

Tests for imports

bcaace1

Add operator assignment test case

ad90125

JukkaL added 2 commits March 12, 2025 16:29

Address more comments

2c6a144

Add TODO comments

daf9c75