Add slots to dataclass schema #617

hramezani · 2023-05-19T16:11:46Z

This is for fixing the dataclass slots problem reported in pydantic/pydantic#5797

I've created this PR based on @samuelcolvin suggestion

Selected Reviewer: @dmontagu

codspeed-hq · 2023-05-19T16:16:02Z

CodSpeed Performance Report

Merging #617 dataclass_slots (559e0c6) will not alter performances.

Summary

🔥 0 improvements
❌ 0 regressions
✅ 120 untouched benchmarks

🆕 0 new benchmarks
⁉️ 0 dropped benchmarks

tests/test_schema_functions.py

adriangb · 2023-05-19T17:48:22Z

tests/validators/test_dataclasses.py

+@pytest.mark.skipif(sys.version_info < (3, 10), reason='slots are only supported for dataclasses in Python > 3.10')
+def test_slots() -> None:
+    kwargs = {'slots': True}


@hramezani please make this test pass. We should also add a test for a function validator stuck in between the DataclassSchema and DataclassArgsSchema for both before and after variants. See for example test_dataclass_field_before_validator.

adriangb · 2023-05-19T20:20:11Z

Approving as long as you get those tests passing 😀

samuelcolvin

otherwise LGTM in principle. Let me know if you need help with failing test.

src/validators/dataclass.rs

dmontagu · 2023-05-22T21:01:12Z

src/input/input_python.rs

-
-    fn is_exact_instance(&self, class: &PyType) -> bool {
-        self.get_type().is(class)
+    fn input_is_instance(&self, class: &PyType) -> Option<&PyAny> {


Not a big deal, but given this returns an Option<&PyAny> rather than a bool the name seems a bit weird to me (I would expect it to return a bool since the name sounds like an assertion). I'd suggest downcast or similar, but also fine keeping as is if you prefer.

src/serializers/shared.rs

dmontagu · 2023-05-22T21:26:51Z

src/validators/dataclass.rs

@@ -441,6 +442,17 @@ impl BuildValidator for DataclassValidator {
            None
        };

+        let slots = match schema.get_as::<&PyList>(intern!(py, "slots"))? {


Are we implicitly making the assumption that dataclasses are all-slots or no-slots-at-all? (Just want to make sure things work correctly since we just confirmed that's not the case, since you can add non-slots fields in subclasses of slots dataclasses.)

Seems to me if you always take the __dict__ (if present) and grab the __slots__, things should work, but of course the devil is in the implementation details.

dmontagu · 2023-05-22T21:27:15Z

src/validators/dataclass.rs

-        let new_dict = dict.copy()?;
+        let new_dict = if let Some(ref slots) = self.slots {
+            let slots_dict = PyDict::new(py);
+            for slot in slots {


yeah it looks like this implicitly assumes everything is a slot, I think if you start by trying to grab the __dict__ and ignoring if it fails, then grabbing the slots one by one, it should be good. Alternatively you might store a bool on the struct (or in a once_cell or whatever) indicating whether there are any non-slots fields in the schema; I guess that's probably the most performant way to modify this. (Short of trying to do something fancy based on class layout when slots exist..)

dmontagu · 2023-05-22T21:28:06Z

src/validators/dataclass.rs

@@ -595,7 +624,14 @@ impl DataclassValidator {
        input: &'data impl Input<'data>,
    ) -> ValResult<'data, ()> {
        let (dc_dict, post_init_kwargs): (&PyAny, &PyAny) = val_output.extract(py)?;
-        force_setattr(py, dc, intern!(py, "__dict__"), dc_dict)?;
+        if self.slots.is_some() {


Similar issue here where you probably want to know which fields correspond to slots and which don't and handle appropriately.

I think there's a reasonable world where we tell people "if you use slots in your pydantic dataclasses, then all subclasses must be slots dataclasses, and all fields must have slots". But if we do that, then I think we need to tell people up front that their type has a problem, rather than delaying the issue to runtime validation, or worse, silently not fully initializing the objects.

But I think it would probably be preferable if we just handled mixed slots/__dict__ dataclasses properly automatically and didn't force handling on users

… supported

tests/validators/test_dataclasses.py

dmontagu · 2023-05-23T22:06:23Z

pydantic_core/core_schema.py

@@ -3132,6 +3134,7 @@ def dataclass_schema(
        metadata: Any other information you want to include with the schema, not used by pydantic-core
        serialization: Custom serialization schema
        frozen: Whether the dataclass is frozen
+        slots: The slots to use for the dataclass, set only if `slots=True` on the dataclass


Suggested change

slots: The slots to use for the dataclass, set only if `slots=True` on the dataclass

slots: The slots to use for the dataclass, set only if `slots=True` on the dataclass or one of its bases

src/serializers/type_serializers/dataclass.rs

src/validators/dataclass.rs

dmontagu

Noticed what I think is one more issue, related to validate_assignment when the dataclass has mixed slots and non-slots fields.

Also, sanity check — are we still using the slots field of the schema in a meaningful way? It seems that, other than the validate_assignment thing, we're just doing getattrs on all the fields. So maybe slots doesn't need to be tracked now? I'm not sure if that's right though. (Probably a good idea to track it for the sake of future performance improvements though.)

samuelcolvin · 2023-05-24T12:39:51Z

I think this is ready to please review.

samuelcolvin · 2023-05-25T08:05:28Z

I'm going to merge this as I need it to work on something else. But please free free to give more feedback.

hramezani requested review from adriangb and dmontagu May 19, 2023 16:16

adriangb reviewed May 19, 2023

View reviewed changes

tests/test_schema_functions.py Outdated Show resolved Hide resolved

adriangb reviewed May 19, 2023

View reviewed changes

tests/test_schema_functions.py Outdated Show resolved Hide resolved

adriangb force-pushed the dataclass_slots branch 2 times, most recently from 7e79af7 to cfd877f Compare May 19, 2023 17:47

adriangb reviewed May 19, 2023

View reviewed changes

adriangb approved these changes May 19, 2023

View reviewed changes

hramezani and others added 5 commits May 22, 2023 15:37

Add slots to dataclass schema

04907a3

add test

c2dc0fd

Add test

83ac340

handle tests

d2623da

Fix for validation and revalidation

1081443

hramezani force-pushed the dataclass_slots branch from 5d38943 to 1081443 Compare May 22, 2023 12:07

Fix lint

2862361

samuelcolvin reviewed May 22, 2023

View reviewed changes

src/validators/dataclass.rs Outdated Show resolved Hide resolved

src/validators/dataclass.rs Outdated Show resolved Hide resolved

src/validators/dataclass.rs Outdated Show resolved Hide resolved

Skip tests

7daf210

hramezani force-pushed the dataclass_slots branch from dc53011 to 7daf210 Compare May 22, 2023 12:32

samuelcolvin added 2 commits May 22, 2023 21:45

fix dataclass support with slots, cleanup input

50d3e24

fix for python 3.11

00a9aaf

dmontagu reviewed May 22, 2023

View reviewed changes

src/serializers/shared.rs Outdated Show resolved Hide resolved

dmontagu reviewed May 22, 2023

View reviewed changes

samuelcolvin and others added 3 commits May 22, 2023 22:38

properly match dataclasses.fields logic

9eae5de

fix test_dataclass_classvar

c92e7da

Update the note about python versions under which dataclass slots are…

66072e0

… supported

dmontagu reviewed May 22, 2023

View reviewed changes

tests/validators/test_dataclasses.py Show resolved Hide resolved

fix dataclass validation & serialization

a0826b1

dmontagu reviewed May 23, 2023

View reviewed changes

src/serializers/type_serializers/dataclass.rs Show resolved Hide resolved

dmontagu reviewed May 23, 2023

View reviewed changes

src/validators/dataclass.rs Show resolved Hide resolved

dmontagu requested changes May 23, 2023

View reviewed changes

This comment was marked as duplicate.

Sign in to view

pydantic-hooky bot added the awaiting author revision label May 23, 2023

pydantic-hooky bot assigned hramezani May 23, 2023

hramezani mentioned this pull request May 24, 2023

Update pydantic-core to 0.35.0 pydantic/pydantic#5846

Merged

add dataclass.fields to schema

559e0c6

pydantic-hooky bot added ready for review and removed awaiting author revision labels May 24, 2023

pydantic-hooky bot assigned samuelcolvin and unassigned hramezani May 24, 2023

samuelcolvin assigned dmontagu and unassigned samuelcolvin May 25, 2023

samuelcolvin merged commit 26fa27d into main May 25, 2023

samuelcolvin deleted the dataclass_slots branch May 25, 2023 08:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add slots to dataclass schema #617

Add slots to dataclass schema #617

hramezani commented May 19, 2023 •

edited by samuelcolvin

Loading

codspeed-hq bot commented May 19, 2023 •

edited

Loading

adriangb May 19, 2023

adriangb commented May 19, 2023

samuelcolvin left a comment

dmontagu May 22, 2023

dmontagu May 22, 2023 •

edited

Loading

dmontagu May 22, 2023 •

edited

Loading

dmontagu May 22, 2023

dmontagu May 22, 2023

dmontagu May 23, 2023

dmontagu left a comment •

edited

Loading

This comment was marked as duplicate.

samuelcolvin commented May 24, 2023

samuelcolvin commented May 25, 2023

	slots: The slots to use for the dataclass, set only if `slots=True` on the dataclass
	slots: The slots to use for the dataclass, set only if `slots=True` on the dataclass or one of its bases

Add slots to dataclass schema #617

Add slots to dataclass schema #617

Conversation

hramezani commented May 19, 2023 • edited by samuelcolvin Loading

codspeed-hq bot commented May 19, 2023 • edited Loading

CodSpeed Performance Report

Summary

adriangb May 19, 2023

Choose a reason for hiding this comment

adriangb commented May 19, 2023

samuelcolvin left a comment

Choose a reason for hiding this comment

dmontagu May 22, 2023

Choose a reason for hiding this comment

dmontagu May 22, 2023 • edited Loading

Choose a reason for hiding this comment

dmontagu May 22, 2023 • edited Loading

Choose a reason for hiding this comment

dmontagu May 22, 2023

Choose a reason for hiding this comment

dmontagu May 22, 2023

Choose a reason for hiding this comment

dmontagu May 23, 2023

Choose a reason for hiding this comment

dmontagu left a comment • edited Loading

Choose a reason for hiding this comment

This comment was marked as duplicate.

samuelcolvin commented May 24, 2023

samuelcolvin commented May 25, 2023

hramezani commented May 19, 2023 •

edited by samuelcolvin

Loading

codspeed-hq bot commented May 19, 2023 •

edited

Loading

dmontagu May 22, 2023 •

edited

Loading

dmontagu May 22, 2023 •

edited

Loading

dmontagu left a comment •

edited

Loading