Fix type-safety of attribute access in `BaseModel` #8651

bluenote10 · 2024-01-27T08:58:44Z

Change Summary

This PR fixes the type-safety of attribute access of BaseModel.

I noticed that in addition to __setattr__, __delattr__ should get the same treatment. Otherwise the same issue would arise with del model.non_existing_field (albeit deleting fields is probably a bit strange to begin with).

Related issue number

fix #8643

Checklist

The pull request title is a good summary of the changes - it will be used in the changelog
Unit tests for the changes exist
Tests pass on CI
Documentation reflects the changes where applicable
My PR is ready to review, please add a comment including the phrase "please review" to assign reviewers

Regarding test: Are you familiar with the notion of type tests via # type: ignore?

I wanted to add a such a type test to make sure there cannot be a regression again. The idea is to leverage mypy's warn_unused_ignores feature, which you have already enabled as far as I can see. The nice thing about the feature is that # type: ignores are only allowed if and only if a particular line does not type check. This enables library authors to verify their interface in terms of typing guarantees. What I wanted to add is something like this to the test_main.py:

if typing.TYPE_CHECKING:
    # The following test(s) are purely type-checking-time tests; no runtime execution needed.

    def test_typesafety_of_attributes():

        class Model(BaseModel):
            foo: int

        model = Model(foo=42)

        model.non_existing_field  # type: ignore[attr-defined]
        model.non_existing_field = ...  # type: ignore[attr-defined]
        del model.non_existing_field  # type: ignore[attr-defined]

Checking this snippet with mypy would fail on the main branch, because the last two lines actually don't produce type errors. After the change in main.py they do behave correctly and the type ignores become mandatory as one would expect. Therefore the snippet could guarantee that __getattr__/__setattr__/__delattr__ don't accidentally leak towards the type checker again.

Unfortunately it looks like the tests are not typed checked though, i.e., the tests does something only when I do mypy tests/test_main.py explicitly, but running just make seems to ignore all type errors in the test.

Could you advice on how to proceed with the test?

Wouldn't it make sense to type check the tests as well? Currently they seem to have quite a lot of type errors, so this would be a bit of work, but could add some value. In general it is always nice to see that a test that does something with pytest.raises actually does require some # type: ignore, because it indicates that the runtime behavior and the type-checker time behavior are in line! So by forcing oneself to place these # type: ignore on these "expected errors", it gets pretty clear how well the typings match with actual behavior. What do you think?

Selected Reviewer: @adriangb

bluenote10 · 2024-01-27T08:58:58Z

please review

codspeed-hq · 2024-01-27T09:02:34Z

CodSpeed Performance Report

Merging #8651 will not alter performance

_{Comparing bluenote10:fix_type_safety_of_attribute_access (e43bf96) with main (b785d5b)}

Summary

✅ 10 untouched benchmarks

adriangb · 2024-01-27T14:17:12Z

Personally, I like type-checking tests but that would be a big change on this repo that should be discussed separately.

Maybe we can add this to the mypy or pyright tests?
https://github.com/pydantic/pydantic/tree/main/tests/pyright
https://github.com/pydantic/pydantic/tree/main/tests/mypy

It's a bit hard to tell from the diff but I think all you did was indent these functions into a TYPE_IGNORE right? There were no changes to the function bodies?

bluenote10 · 2024-01-27T17:12:09Z

It's a bit hard to tell from the diff but I think all you did was indent these functions into a TYPE_IGNORE right? There were no changes to the function bodies?

Yes, no other changes, just the indentation of the two methods. Unfortunately, GitHub's diff algorithm is a bit suboptimal, and makes the diff look worse than it is (it associates some else: statements with different else: statements because they happen to have the same indentation after the indentation change). To verify that it is purely a whitespace change, you can enable it here in the review:

Maybe we can add this to the mypy or pyright tests?

I will look into that to see where it would fit.

bluenote10 · 2024-01-28T12:01:06Z

Maybe we can add this to the mypy or pyright tests?

I had a closer look now, but it feels like this is not quite the right place for it, no? On first glance these seem more like "meta tests" that are responsible for the execution of the type checkers, managing versions and handling results. It is not quite the right place to actually type check certain aspects of API interface like BaseModel. That would be better suited to be placed alongside the actual (runtime) tests of these API interfaces, which are the normal test modules like test_main.py in the case of BaseModel.

Also, the modules in tests/mypy/tests/pyright do not seem to be type checked, so just moving the snippet there wouldn't work out-of-the-box either I guess.

adriangb · 2024-01-28T12:34:38Z

@samuelcolvin how would you feel about enabling type-checking for tests and using includes/excludes to either (1) exclude all current files or (2) exclude all files and whitelist only some files (I'm not sure this is possible).

bluenote10 · 2024-01-28T16:23:39Z

(2) exclude all files and whitelist only some files (I'm not sure this is possible).

This should be possible in general. At least I've successfully used something like this in mypy.ini's (pseudo-code, didn't look up the exact syntax):

# blacklist tests
[mypy-tests.*]
ignore_errors = True

# whitelist specific modules/packages
[mypy-tests.test_some_module.py]
ignore_errors = False
[mypy-tests.core.*]
ignore_errors = False

I'm not quite sure though if mypy's filtering requires proper "package" semantics to work (which may require placing __init__.py inside the tests folder structure, which is a subtle topic).

samuelcolvin · 2024-01-29T15:10:09Z

I really don't want to add type checking to tests.

We have mypy tests, let's use them and if they're not sufficient, improve them.

samuelcolvin · 2024-01-29T15:20:10Z

😕

Using # type: ignore in general purpose tests and thereby causing those tests to perform two quite separate tasks implicitly is a recipe for a mess.

I'm no fan of "everything should be type hinted, everything should be type checked" absolutism, it's a wrong-headed as the "I don't need type hints" crowd.

What we should do however is significantly simply our "mypy tests" and convert them to "static-typing tests", they should be made up of:

static_type_tests/mypy/*.py - small files which should all pass mypy with "no unused ignores" enabled
static_type_tests/pyright/*.py - small files which should all pass pyright with "no unused ignores" enabled
static_type_tests/all/*.py - which should be tested with both

All those files should all be imported and not raise an error (via a parameteriesd test), thereby confirming their runtime behaviour is correct.

adriangb · 2024-01-29T16:05:37Z

Okay then let’s add relnotes and merge this without adding type checking tests

bluenote10 · 2024-01-29T18:14:58Z

Of course, I have not familiar enough with the existing code and test infrastructure to really judge what is best suited, but just a few thoughts on this.

Using # type: ignore in general purpose tests and thereby causing those tests to perform two quite separate tasks

The point is that runtime behavior and type-checking semantics are not necessarily two separate tasks. Even the contrary: It can be easier to look at them jointly. For instance, if you look at a typical test that requires a # type: ignore like

    class UltraSimpleModel(BaseModel):
        a: float

    m = UltraSimpleModel(a=10.2)

    with pytest.raises(ValueError):
        m.c = 20 # type: ignore[attr-defined]

it can be quite satisfying to see that the pytest.raises exactly matches up with a # type: ignore. This indicates that the runtime behavior and the type checking semantics are perfectly in line.

[...] is a recipe for a mess

For what it's worth, we have experimented with both approaches, and have made good experiences with it. Seeing the alignment of runtime behavior with the type checking semantics can be quite valuable. For instances, we have discovered cases, where a pytest.raises was missing a corresponding # type: ignore indicating that the type system wasn't modeling the runtime behavior properly (and e.g. introducing an @overload would fix that), or vice versa. I probably should have written the test case as:

def test_attribute_semantics():
    class Model(BaseModel):
        foo: int

    model = Model(foo=42)

    with pytest.raises(AttributeError):
        model.non_existing_field  # type: ignore[attr-defined]
    with pytest.raises(AttributeError):
        model.non_existing_field = ...  # type: ignore[attr-defined]
    with pytest.raises(AttributeError):
        del model.non_existing_field  # type: ignore[attr-defined]

On the main branch, mypy would have complained about unused type ignores in two of the three cases, highlighting the discrepancy between runtime and time system.

I'm no fan of "everything should be type hinted, everything should be type checked" absolutism

Me neither, and that's not what I wanted to suggest. I rather had a lightweight mypy profile in mind. For instance, I'd always advocate for disallow_untyped_defs = False in tests, because it is pointless having the use -> None on all the test functions.

What we should do however is significantly simply our "mypy tests" and convert them to "static-typing tests"

This could cause some duplication because often the same code makes sense as a runtime check and as a type system check, like the cases above.

All those files should all be imported and not raise an error (via a parameteriesd test), thereby confirming their runtime behaviour is correct.

I may be misunderstanding the idea, but testing for "does not type check" typically exactly implies "does raise an error at runtime", no? Doesn't that lead to the need for wrappers similar to pytest.raises again?

bluenote10 · 2024-01-29T18:26:09Z

Okay then let’s add relnotes and merge this without adding type checking tests

@adriangb I couldn't figure out what exactly I have to do. If it refers to the GitHub labels I don't seem to have the rights to add one. Or should I write something in a change log somewhere?

fix type-safety of attribute access

e43bf96

pydantic-hooky bot added the ready for review label Jan 27, 2024

pydantic-hooky bot assigned adriangb Jan 27, 2024

adriangb added relnotes-fix Used for bugfixes. and removed ready for review labels Jan 29, 2024

adriangb enabled auto-merge (squash) January 29, 2024 21:15

adriangb approved these changes Jan 30, 2024

View reviewed changes

adriangb merged commit c1dff15 into pydantic:main Jan 30, 2024
55 of 57 checks passed

Viicos mentioned this pull request Mar 15, 2024

mypy doesn't complain about non-existent attributes when setting #9017

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix type-safety of attribute access in `BaseModel` #8651

Fix type-safety of attribute access in `BaseModel` #8651

bluenote10 commented Jan 27, 2024 •

edited

bluenote10 commented Jan 27, 2024

codspeed-hq bot commented Jan 27, 2024

adriangb commented Jan 27, 2024

bluenote10 commented Jan 27, 2024

bluenote10 commented Jan 28, 2024

adriangb commented Jan 28, 2024

bluenote10 commented Jan 28, 2024

samuelcolvin commented Jan 29, 2024

samuelcolvin commented Jan 29, 2024

adriangb commented Jan 29, 2024

bluenote10 commented Jan 29, 2024

bluenote10 commented Jan 29, 2024

Fix type-safety of attribute access in BaseModel #8651

Fix type-safety of attribute access in BaseModel #8651

Conversation

bluenote10 commented Jan 27, 2024 • edited

Change Summary

Related issue number

Checklist

bluenote10 commented Jan 27, 2024

codspeed-hq bot commented Jan 27, 2024

CodSpeed Performance Report

Merging #8651 will not alter performance

Summary

adriangb commented Jan 27, 2024

bluenote10 commented Jan 27, 2024

bluenote10 commented Jan 28, 2024

adriangb commented Jan 28, 2024

bluenote10 commented Jan 28, 2024

samuelcolvin commented Jan 29, 2024

samuelcolvin commented Jan 29, 2024

adriangb commented Jan 29, 2024

bluenote10 commented Jan 29, 2024

bluenote10 commented Jan 29, 2024

Fix type-safety of attribute access in `BaseModel` #8651

Fix type-safety of attribute access in `BaseModel` #8651

bluenote10 commented Jan 27, 2024 •

edited