Support flexible TypedDict creation/update #15425

ilevkivskyi · 2023-06-12T23:53:49Z

Fixes #9408
Fixes #4122
Fixes #6462
Supersedes #13353

This PR enables two similar technically unsafe behaviors for TypedDicts, as @JukkaL explained in #6462 (comment) allowing an "incomplete" TypedDict as an argument to .update() is technically unsafe (and a similar argument applies to ** syntax in TypedDict literals). These are however very common patterns (judging from number of duplicates to above issues), so I think we should support them. Here is what I propose:

Always support cases that are safe (like passing the type itself to update)
Allow popular but technically unsafe cases by default
Have a new flag (as part of --strict) to fall back to current behavior

Note that unfortunately we can't use just a custom new error code, since we need to conditionally tweak some types in a plugin. Btw there are couple TODOs I add here:

First is for unsafe behavior for repeated TypedDict keys. This is not new, I just noticed it when working on this
Second is for tricky corner case involving multiple ** items where we may have false-negatives in strict mode.

Note that I don't test all the possible combinations here (since the phase space is huge), but I think I am testing all main ingredients (and I will be glad to add more if needed):

All syntax variants for TypedDicts creation are handled
Various shadowing/overrides scenarios
Required vs non-required keys handling
Union types (both as item and target types)
Inference for generic TypedDicts
New strictness flag

More than half of the tests I took from the original PR #13353 by @eflorico

ilevkivskyi · 2023-06-13T08:27:02Z

OK, so about the few new errors in mypy_primer:

Auto-Split: two new errors are correct, previously not-shown because inferred type from previous error was Any. Btw @hauntsaninja that repo is archived, it looks like development is now at https://github.com/Toufool/AutoSplit
pydantic: Previously error code was misc, so now type ignores stopped working. Not sure what is the policy on this, I guess this is OK. Also maybe we can allow **Any and **dict[Any, Any] in TypedDict constructor? We probably can still give an error about missing required keys.

JelleZijlstra

Thanks! I didn't look at the code in detail, but I do think the Any cases should be allowed. Any means it could be anything, so as long as the runtime type could be something legal, we shouldn't show an error if you use Any.

JelleZijlstra · 2023-06-14T02:59:00Z

test-data/unit/check-typeddict.test

+    pass
+
+foo: Dict[str, Any] = {}
+bar: Bar = {**foo}  # E: Unsupported type "Dict[str, Any]" for ** expansion in TypedDict


I feel this should be allowed; Any means Any.

I agree with Ivan that for consistency we can only this if the type is Dict[Any, Any]. However, I'm not sure if Dict[Any, Any] is supported in other contexts (need to double check).

FWIW as far as I know Dict[Any, Any] is not well supported in other TypedDict contexts. I decided to allow Dict[Any, Any] only because I found an example in mypy_primer where Dict[Any, Any] is used specifically in ** context. I think we can allow Dict[Any, Any] in other contexts, e.g. it can be a (non-proper) subtype of all non-total TypedDicts.

JelleZijlstra · 2023-06-14T03:03:11Z

test-data/unit/check-typeddict.test

+x: Any
+y: Dict[Any, Any]
+z: Union[Any, Dict[Any, Any]]
+t1: Foo = {**x}  # E: Missing key "a" for TypedDict "Foo"


I feel these should all be allowed.

I don't have a strong opinion, but a reasonable first step would be to do the same we do with Dict[Any, Any] in other TypedDict contexts, and if we decide to make things more flexible, we should do consistently.

hauntsaninja · 2023-06-14T03:12:15Z

For what it's worth, the behaviour with Any for this kind of thing was intentional, see #4976 (comment)
pyright's behaviour also matches mypy's here.

ilevkivskyi · 2023-06-14T10:42:11Z

Just to clarify, there are two question/decisions here: First question is should we allow:

**Any -- IMO definitely yes
**dict[Any, Any] -- I think probably yes. My motivation is this is kind of like allowing a: Any; k: Literal["key"] = a (and we allow this)
**dict[str, Any] -- IMO definitely no, because this is like allowing s: str; k: Literal["key"] = s. str is not like Any in the world of literals, it is like object in the world of literals.

Second question is whether we should still warn about missing required keys in cases where we allow Any types. I think probably yes, my motivation is this is kind of like we always prohibit

a: Any
k: Final = "key"
k = a

This is fine from the purely type comparison point of view (because of Any), but not semantically correct.

ilevkivskyi · 2023-06-18T23:10:36Z

Are there any other comments on this? (Also besides the Any type behavior)

hauntsaninja

For strict update, can we also allow in the case that the argument TypedDict is final? Similar situation to #7981 / microsoft/pyright#1899 (comment)

hauntsaninja · 2023-06-19T23:33:37Z

test-data/unit/check-typeddict.test

+[case testTypedDictUnpackSame]
+from typing import TypedDict


nit: might as well test this one with strict. Probably some of the other test cases with no errors could be tested with strict as well, like testTypedDictUnpackCompatible, testTypedDictUnpackMultiple

Suggested change

[case testTypedDictUnpackSame]

from typing import TypedDict

[case testTypedDictUnpackSame]

# flags: --strict-typeddict-update

from typing import TypedDict

ilevkivskyi · 2023-06-22T22:15:34Z

@hauntsaninja

For strict update, can we also allow in the case that the argument TypedDict is final?

I think this is kind of a separate question, first we need a more general support for final TypedDicts, i.e. first #7981 needs to be closed, most importantly a TypedDict with extra keys should not be a subtype of a final TypedDict. Once this is done, yes, we can allow final TypedDicts even in strict mode.

JukkaL · 2023-06-23T13:18:42Z

I think that it's reasonable to let users enable the old (safe) behavior using a command-line flag, but adding a new flag doesn't seem worth it. I'd suggest adding a new flag that would cover both --strict-typeddict-update and --strict-concatenate, and we'd make --strict-concatenate a deprecated flag that isn't shown in --help. The new flag could be called --pedantic or --extra-checks, for example. We can later add more safety checks that are technically correct, will probably not find many real issues, but can cause friction, behind this flag. Some other flags, such as --strict-equality, should probably not be merged into this flag, since even though it has false positives, it often finds real issues.

JukkaL

Thanks! These are very useful and often requested improvements. Some details are still open, but they can be refined later on after this has been merged.

JukkaL · 2023-06-23T15:57:55Z

test-data/unit/check-typeddict.test

+    pass
+
+foo: Dict[str, Any] = {}
+bar: Bar = {**foo}  # E: Unsupported type "Dict[str, Any]" for ** expansion in TypedDict


I agree with Ivan that for consistency we can only this if the type is Dict[Any, Any]. However, I'm not sure if Dict[Any, Any] is supported in other contexts (need to double check).

JukkaL · 2023-06-23T15:58:55Z

test-data/unit/check-typeddict.test

+x: Any
+y: Dict[Any, Any]
+z: Union[Any, Dict[Any, Any]]
+t1: Foo = {**x}  # E: Missing key "a" for TypedDict "Foo"


I don't have a strong opinion, but a reasonable first step would be to do the same we do with Dict[Any, Any] in other TypedDict contexts, and if we decide to make things more flexible, we should do consistently.

github-actions · 2023-06-25T23:09:26Z

Diff from mypy_primer, showing the effect of this PR on open source code:

AutoSplit (https://github.com/Toufool/AutoSplit)
- src/user_profile.py:118: error: Expected keyword arguments, {...}, or dict(...) in TypedDict constructor  [misc]
+ src/user_profile.py:133: error: TypedDict key must be a string literal; expected one of ("split_hotkey", "reset_hotkey", "undo_split_hotkey", "skip_split_hotkey", "pause_hotkey", ...)  [literal-required]
+ src/user_profile.py:137: error: TypedDict key must be a string literal; expected one of ("split_hotkey", "reset_hotkey", "undo_split_hotkey", "skip_split_hotkey", "pause_hotkey", ...)  [literal-required]

graphql-core (https://github.com/graphql-python/graphql-core): typechecking got 1.05x slower (361.3s -> 380.5s)
(Performance measurements are based on a single noisy sample)

optuna (https://github.com/optuna/optuna)
+ Warning: --strict-concatenate is deprecated; use --extra-checks instead

pydantic (https://github.com/samuelcolvin/pydantic)
+ pydantic/v1/networks.py:237: error: Unsupported type "dict[str, str]" for ** expansion in TypedDict  [typeddict-item]
+ pydantic/v1/networks.py:237: note: Error code "typeddict-item" not covered by "type: ignore" comment

hydra-zen (https://github.com/mit-ll-responsible-ai/hydra-zen)
- src/hydra_zen/wrapper/_implementations.py:1472: error: Argument 1 to "update" of "TypedDict" has incompatible type "_StoreCallSig"; expected "TypedDict({'name'?: str | Callable[[Any], str], 'group'?: str | None | Callable[[Any], str | None], 'package'?: str | Callable[[Any], str] | None, 'provider'?: str | None, '__kw'?: dict[str, Any], 'to_config'?: Callable[[Any], Any]})"  [typeddict-item]

eflorico · 2023-07-02T19:23:35Z

Thank you for your work @ilevkivskyi, this is really exciting! I'm glad I could help a little bit through my earlier tests 😊

LarsDu · 2023-07-06T08:03:28Z

Thanks for this @ilevkivskyi. I was shocked this was only fixed so recently

ilevkivskyi added 5 commits June 11, 2023 21:50

Start working

82f330a

Cleanups; update union handling; more tests

b8b4fa7

Add inference test case

7925535

Add another test case from issue

1aa4218

Handle strictnes also for star items

57f57c4

ilevkivskyi requested review from JelleZijlstra, JukkaL and hauntsaninja June 12, 2023 23:53

This comment has been minimized.

Sign in to view

ilevkivskyi added 2 commits June 13, 2023 10:26

Allow Any in star unpacks; add one more union test

4d2f961

Update test

7b0ecb5

This comment has been minimized.

Sign in to view

Support plain dict syntax as well

d6ed5cf

This comment has been minimized.

Sign in to view

JelleZijlstra reviewed Jun 14, 2023

View reviewed changes

hauntsaninja reviewed Jun 19, 2023

View reviewed changes

hauntsaninja approved these changes Jun 19, 2023

View reviewed changes

ilevkivskyi added 2 commits June 22, 2023 23:16

Merge remote-tracking branch 'upstream/master' into support-start-td

dbb36fb

Address CR

98cdc48

This comment has been minimized.

Sign in to view

JukkaL approved these changes Jun 23, 2023

View reviewed changes

ilevkivskyi added 2 commits June 23, 2023 23:03

Merge remote-tracking branch 'upstream/master' into support-start-td

2664bd2

Merge flags and deprecate old one

a98f555

This comment has been minimized.

Sign in to view

Add docs for new flag

12fcb4f

ilevkivskyi merged commit 8290bb8 into python:master Jun 26, 2023

ilevkivskyi deleted the support-start-td branch June 26, 2023 23:26

ilevkivskyi mentioned this pull request Jun 26, 2023

Allow unpacking of TypedDict into TypedDict #13353

Closed

cdce8p mentioned this pull request Jun 27, 2023

Remove --extra-checks from strict mode #15532

Closed

A5rocks mentioned this pull request Jul 18, 2023

TypedDict 'in' narrowing w/o @final #15697

Open

hauntsaninja mentioned this pull request Oct 30, 2024

Reconsider --extra-checks #18070

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support flexible TypedDict creation/update #15425

Support flexible TypedDict creation/update #15425

ilevkivskyi commented Jun 12, 2023

This comment has been minimized.

ilevkivskyi commented Jun 13, 2023

This comment has been minimized.

This comment has been minimized.

JelleZijlstra left a comment

JelleZijlstra Jun 14, 2023

JukkaL Jun 23, 2023

ilevkivskyi Jun 23, 2023

JelleZijlstra Jun 14, 2023

JukkaL Jun 23, 2023

hauntsaninja commented Jun 14, 2023 •

edited

Loading

ilevkivskyi commented Jun 14, 2023

ilevkivskyi commented Jun 18, 2023

hauntsaninja left a comment

hauntsaninja Jun 19, 2023

ilevkivskyi commented Jun 22, 2023

This comment has been minimized.

JukkaL commented Jun 23, 2023

JukkaL left a comment

JukkaL Jun 23, 2023

JukkaL Jun 23, 2023

This comment has been minimized.

github-actions bot commented Jun 25, 2023

eflorico commented Jul 2, 2023

LarsDu commented Jul 6, 2023

Support flexible TypedDict creation/update #15425

Support flexible TypedDict creation/update #15425

Conversation

ilevkivskyi commented Jun 12, 2023

This comment has been minimized.

ilevkivskyi commented Jun 13, 2023

This comment has been minimized.

This comment has been minimized.

JelleZijlstra left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hauntsaninja commented Jun 14, 2023 • edited Loading

ilevkivskyi commented Jun 14, 2023

ilevkivskyi commented Jun 18, 2023

hauntsaninja left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ilevkivskyi commented Jun 22, 2023

This comment has been minimized.

JukkaL commented Jun 23, 2023

JukkaL left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment has been minimized.

github-actions bot commented Jun 25, 2023

eflorico commented Jul 2, 2023

LarsDu commented Jul 6, 2023

hauntsaninja commented Jun 14, 2023 •

edited

Loading