feat: Computed field serialization for TypedDict #1018

michaelhly · 2023-10-15T09:31:24Z

Change Summary

Updated computed field serialization to use the serialization function directly (if one exists) instead of always looking up a model attribute.

Related issue number

fix #657

Checklist

Unit tests for the changes exist
Documentation reflects the changes where applicable
Pydantic tests pass with this pydantic-core (except for expected changes)
My PR is ready to review, please add a comment including the phrase "please review" to assign reviewers

Selected Reviewer: @samuelcolvin

codecov · 2023-10-15T09:35:16Z

Codecov Report

Merging #1018 (a4f0ca1) into main (23d1065) will increase coverage by 0.03%.
The diff coverage is 100.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1018      +/-   ##
==========================================
+ Coverage   93.12%   93.15%   +0.03%     
==========================================
  Files         106      106              
  Lines       15952    15994      +42     
  Branches       35       35              
==========================================
+ Hits        14855    14899      +44     
+ Misses       1090     1088       -2     
  Partials        7        7

Files	Coverage Δ
src/serializers/computed_fields.rs	`97.04% <100.00%> (+0.97%)`	⬆️

... and 2 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 23d1065...a4f0ca1. Read the comment docs.

codspeed-hq · 2023-10-15T09:40:29Z

CodSpeed Performance Report

Merging #1018 will improve performances by 13.01%

_{Comparing michaelhly:computed-field-ser-func (a4f0ca1) with main (23d1065)}

Summary

⚡ 1 improvements
✅ 139 untouched benchmarks

Benchmarks breakdown

	Benchmark	`main`	`michaelhly:computed-field-ser-func`	Change
⚡	`test_generator_rust`	35.3 µs	31.2 µs	+13.01%

michaelhly · 2023-10-15T10:25:00Z

please review

davidhewitt

Looks generally like a fine approach, with a couple of small thoughts. I wonder how this looks when applied to a TypedDict? I'm unsure if it's actually valid to have methods on a TypedDict?

src/serializers/computed_fields.rs

tests/serializers/test_model.py

michaelhly · 2023-10-25T18:27:04Z

Looks generally like a fine approach, with a couple of small thoughts. I wonder how this looks when applied to a TypedDict? I'm unsure if it's actually valid to have methods on a TypedDict?

We compute based on the serialization function provided in the schema instead of adding a method on the TypedDict model. Here is the corresponding unit test for the described case:
147e917

adriangb · 2023-10-25T19:43:51Z

I'm unsure if it's actually valid to have methods on a TypedDict?

At runtime yes. But type checkers hate it (insert meme reference).

davidhewitt

👍 so I think then in principle we can allow these fields on TypedDict and users just have to type: ignore them?

davidhewitt · 2023-10-26T11:10:38Z

src/serializers/computed_fields.rs

+    // Backwards compatiability.
+    let mut legacy_attr_error: Option<PyErr> = None;
+    let legacy_result = match ob_type_lookup.get_type(input_value) {


I don't think it's fair to call this "backwards compatibility" when this is still expected to be the main code path for models and dataclasses.

I wonder if there might be a more unified way. For a model or dataclass A with computed field b, the analogous functionality really seems to be A.b.__get__(instance). For a TypedDict it looks like that also works:

>>> class Bar(TypedDict): ... @property ... def y(self): ... return 434 ... >>> Bar.y.__get__({}) 434

So maybe what we really want, in all cases, is

let property_value = input_value.get_type().getattr(field.property_name_py.as_ref(py))?.call_method1("__get__", (input_value,))?;

My thinking was that a serialization function should be provided to a computed field, doesn't matter if the input type is a Model, TypedDict, Dataclass, etc.

The default behavior is to compute the computed value from the function provided in the serialization schema and then it gets set in the output_dict:

pydantic-core/src/serializers/computed_fields.rs

Lines 144 to 154 in 866eb2d

let value = self

.serializer

.to_python(next_value, next_include, next_exclude, extra)?;

if extra.exclude_none && value.is_none(py) {

return Ok(());

}

let key = match extra.by_alias {

true => self.alias_py.as_ref(py),

false => property_name_py,

};

output_dict.set_item(key, value)?;

This seems more generalizable to all computed fields instead of relying on the computed field defined as an attribute on the input value.

However, I am probably missing some context on how computed fields are used by https://github.com/pydantic/pydantic.

I'm not following all that closely but my 2c is that ideally we extract the function from the thing in pydantic and not in pydantic-core so that:

We have more flexibility. It's easier to hack things (like rebuild __mro__ based on __orig_bases__ which we do for TypedDict)

It ensures that we do this at schema build time and not runtime

The con of that last one is that in theory someone could want us to use the method on a subclass they pass in as a value, which doesn't apply to TypedDict but also is not what we do for BaseModel and no one has complained 😄

feat: Serialize computed field without a model

0929f62

michaelhly changed the title ~~feat: Serialize computed field without a model~~ feat: Serialize computed field for dictionaries types Oct 15, 2023

michaelhly changed the title ~~feat: Serialize computed field for dictionaries types~~ feat: Computed field serialization for TypedDicts Oct 15, 2023

michaelhly changed the title ~~feat: Computed field serialization for TypedDicts~~ feat: Computed field serialization for TypedDict Oct 15, 2023

michaelhly marked this pull request as ready for review October 15, 2023 10:24

pydantic-hooky bot added the ready for review label Oct 15, 2023

pydantic-hooky bot assigned samuelcolvin Oct 15, 2023

Check if model is a dict

a1bba74

michaelhly force-pushed the computed-field-ser-func branch from caa7cf9 to a1bba74 Compare October 15, 2023 10:27

clean up

4d830d7

michaelhly force-pushed the computed-field-ser-func branch 3 times, most recently from e7d7a9a to 8525d28 Compare October 15, 2023 11:28

Fix

264e6ac

michaelhly force-pushed the computed-field-ser-func branch from 8525d28 to 264e6ac Compare October 15, 2023 11:45

michaelhly added 3 commits October 15, 2023 07:56

Fallback behavior for ObType::Unknown

747dd3e

Update tests

d995188

Refactor get_next_value

fd5521c

michaelhly force-pushed the computed-field-ser-func branch from 54203aa to 4aa13c4 Compare October 15, 2023 13:16

Lint

6dc985f

michaelhly force-pushed the computed-field-ser-func branch from 4aa13c4 to 6dc985f Compare October 15, 2023 14:25

davidhewitt reviewed Oct 25, 2023

View reviewed changes

src/serializers/computed_fields.rs Outdated Show resolved Hide resolved

tests/serializers/test_model.py Outdated Show resolved Hide resolved

michaelhly added 3 commits October 25, 2023 13:54

Address ser_schema comment

d293215

Add test to compute on TypedDict model

147e917

Address comment on error message

bdc1326

Small clean up

144120f

Merge remote-tracking branch 'origin/main' into computed-field-ser-func

a4f0ca1

michaelhly requested a review from davidhewitt October 25, 2023 18:41

davidhewitt reviewed Oct 26, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Computed field serialization for TypedDict #1018

feat: Computed field serialization for TypedDict #1018

michaelhly commented Oct 15, 2023 •

edited

codecov bot commented Oct 15, 2023 •

edited

codspeed-hq bot commented Oct 15, 2023 •

edited

michaelhly commented Oct 15, 2023

davidhewitt left a comment

michaelhly commented Oct 25, 2023

adriangb commented Oct 25, 2023

davidhewitt left a comment

davidhewitt Oct 26, 2023

michaelhly Oct 26, 2023 •

edited

adriangb Oct 26, 2023 •

edited

	let value = self
	.serializer
	.to_python(next_value, next_include, next_exclude, extra)?;
	if extra.exclude_none && value.is_none(py) {
	return Ok(());
	}
	let key = match extra.by_alias {
	true => self.alias_py.as_ref(py),
	false => property_name_py,
	};
	output_dict.set_item(key, value)?;

feat: Computed field serialization for TypedDict #1018

Are you sure you want to change the base?

feat: Computed field serialization for TypedDict #1018

Conversation

michaelhly commented Oct 15, 2023 • edited

Change Summary

Related issue number

Checklist

codecov bot commented Oct 15, 2023 • edited

Codecov Report

codspeed-hq bot commented Oct 15, 2023 • edited

CodSpeed Performance Report

Merging #1018 will improve performances by 13.01%

Summary

Benchmarks breakdown

michaelhly commented Oct 15, 2023

davidhewitt left a comment

Choose a reason for hiding this comment

michaelhly commented Oct 25, 2023

adriangb commented Oct 25, 2023

davidhewitt left a comment

Choose a reason for hiding this comment

davidhewitt Oct 26, 2023

Choose a reason for hiding this comment

michaelhly Oct 26, 2023 • edited

Choose a reason for hiding this comment

adriangb Oct 26, 2023 • edited

Choose a reason for hiding this comment

michaelhly commented Oct 15, 2023 •

edited

codecov bot commented Oct 15, 2023 •

edited

codspeed-hq bot commented Oct 15, 2023 •

edited

michaelhly Oct 26, 2023 •

edited

adriangb Oct 26, 2023 •

edited