-
-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Types not added to binder on initial assignment #2008
Comments
(I came across this issue while working on #1955.) |
I also noticed this while working on the binder and wondered whether it was intentional. It seems unlikely that any real code relies on the current behavior, but as you note, probably most of the binder-related tests do, if they are to test what was intended. I wonder what a good replacement for the tests is, especially in a post-strict-optional world. Maybe add a generic function T = TypeVar('T')
def something() -> T: pass to the test fixtures and then replace initializations like x = 1 # type: Union[int, str] by x = something() # type: Union[int, str] |
Making this change is pretty easy, but it causes 2 new errors when running mypy on mypy. class Super: pass
class Sub(Super): pass
def f() -> List[Super]:
x = Sub() # type: Super
ret = [x]
return ret # E: Incompatible return value type (got List[Sub], expected List[Super]) This can be worked around by giving The second boils down to this: y = [1]
x = y # type: List[Any]
x.append("foo") # E: Argument 1 to "append" of "list" has incompatible type "str"; expected "int" This one should arguably be a Maybe the conclusion is that we should only do this if the specified type is a Union? |
Just doing this for unions sounds reasonable to me. More generally, this would likely be useful for conditional definitions, and unions are commonly used for conditional definitions. For non-unions, this could be reasonable, but it would be much harder to support without breaking the examples given above:
|
I had a long diatribe about this but changed my mind several times while writing, so I guess it's complicated. However I don't see how changing this would break the last Example Jukka gave? |
If we only make this change for unions, it would likely not break a lot of code. My example doesn't work right now, I think, so it's already a problem (but likely not a major one, so we can continue ignoring it). |
Oooh, we definitely need to talk!
…--Guido (mobile)
|
But the proposed change would fix that example wouldn't it? |
Yes. |
(If we'd always adds types to binder, this would fix my example. If we'd only do this for union types then it wouldn't fix it.) |
So then why shouldn't we just fix it the way David originally proposed,
always adding the type to the binder?
|
Adding it always to the binder breaks some other things, such ones mentioned by David in #2008 (comment). There are other things it could break. For example, what if we want to make a collection not support mutation:
Not being able to override the annotated type breaks some pretty basic use cases, and it can be surprising. For union types it is less of a problem, as they are usually used in very specific ways, and they are relatively rare. All in all this looks like a pretty complex issue to me (though the implementation would be straightforward). |
But the above code is already broken if you write it slightly differently: x = None # type: Sequence[int]
x = [1, 2]
x.append(3) # Passes |
This is marked for the 0.4.x milestone, but we can't even decide on whether to do it. Should we move it out? Or make a decision? Personally I'd rather just do it (for all types, not just for unions) and wait for possible fallout. |
Doing it for all types would be the consistent thing to do, and experimenting with this sounds reasonable. We could update mypy documentation to mention this explicitly and describe a workaround, such as using a cast. Example workaround:
Assignments in the class body with a |
If we are going to change this, it's better to it early rather than later, as this will break some existing code. |
OK, then let's do it! |
Off-line, @ddfisher still objects, but it's possible that the problem is not so much with this (tiny) binder change, but with the issues brought up in an earlier comment. These have to do with invariance, where |
To be more specific, Guido has largely convinced me that the binder/inference behavior of the initial assignment and subsequent lines should be the same. However, now I'm not sure that our current behavior on subsequent lines is correct -- no other language that I know about will infer a more specific type based on assignments. I think the main reason we do this currently is for Unions, which feels natural in Python because it's dynamic and because Unions are unwrapped as opposed to living in an ADT. I think we'll have to take a look at a bunch of potential examples to decide what's best. |
Yeah, my counterexample would go something like this: class Shape:
def set_color(self, color: float): ...
class Circle(Shape):
def set_radius(self, r: float): ...
class Square(Shape):
def set_side(self, side: float): ...
def random_shape() -> Shape:
shape: Shape # PEP 526 syntax
if random.random() < 0.5:
shape = Square()
shape.set_side(random.random())
else:
shape = Circle()
shape.set_radius(random.random())
shape.set_color(random.random())
return shape |
Here's another example -- I remember seeing code like this:
|
Another example:
|
Decreasing the priority of this since the current behavior seems to be mostly fine and I haven't heard many complaints about this in a while. I'm actually leaning towards just keeping the current behavior. In Python 3.6 code the binder behavior can be more easily tweaked by annotating the variable before initialization: x: Optional[int]
x = 1
reveal_type(x) # int |
I've started a discussion about this in typing-sig. |
In the typing-sig discussion, @erictraut argued convincingly that type checkers should follow what PEP 526 says, and Pyre's @shannonzhu agreed, so I think we have to follow suit. It should make no difference whether the assignment is combined with the declaration or separate. IOW, we should implement Proposal 3. (PS. Everyone also agreed that narrowing should not occur if the declared type is |
There can be some unintentional? edge cases with special cased types such as from typing import Protocol, Self, overload, SupportsIndex
class SupportsSlicing[T_co](Protocol):
@overload
def __getitem__(self, index: SupportsIndex, /) -> T_co: ...
@overload
def __getitem__(self, index: slice, /) -> Self: ...
lst: list[int] = [1,2,3,4]
x: SupportsSlicing[int] = lst # ✅ OK
tup: tuple[int, ...] = (1, 2, 3, 4)
y: SupportsSlicing[int] = tup # ❌ type error This is because the inferred type is |
This means if you want to declare a union and then immediately use it as one specific member, you'll have an awkward time. For example:
To be consistent with other assignments, we should add types to the binder on initial assignment. I think a number of Union/Optional tests depend on the current behavior, so those will need to be updated.
The text was updated successfully, but these errors were encountered: