Add public BaseModel.iter() method #997

Bobronium · 2019-11-14T22:23:15Z

Feature Request

Add BaseModel.iter() method which will behave exactly like a dict, but without converting nested values to dict by default and will be a generator instead

Why

Imagine you want to iterate through values in model, but also need to exclude some, or use aliases instead of attributes. So what you need to do is write something like this:

for attr, value in model.dict(exclude={...}, by_alias=True).items():
    do_something(attr, value)

Obviously, in this case python will firet evaluate model.dict(exclude={...}, by_alias=True). part, going through all values and only then you will be able to iterate, going through the same values again.

Also, by using dict nested values are force-converted to dict, and this might be very slow when we just want to iterate through top-level values.

We also cannot use existing _iter(), because some work is actually done in dict()

How we can solve it

By having iter() method we can bypass this double-iteration problem and converting-to-dict problems:

for attr, value in model.iter(exclude={...}, by_alias=True):
    do_something(attr, value)

Selected Assignee: @hramezani

The text was updated successfully, but these errors were encountered:

samuelcolvin · 2019-11-14T22:27:03Z

This already exists!

It's called ._iter().

samuelcolvin · 2019-11-14T22:27:39Z

Should be documented I guess.

Bobronium · 2019-11-14T22:28:53Z

This already exists!
It's called ._iter().

Yeah, but I mentioned, why it cannot be used in this case:

We also cannot use existing _iter(), because some work is actually done in dict()

By some work i mean resolving aliases keys and excluding the values

samuelcolvin · 2019-11-14T22:33:34Z

I don't want to add more public method to models, best I think to implement this yourself.

If there's a simple way to achieve this with changes to existing functions of happy review a PR.

samuelcolvin · 2019-11-14T22:34:27Z

Sorry, I missed your comment when replying in a hurry.

Bobronium · 2019-11-14T22:46:03Z

I don't want to add more public method to models, best I think to implement this yourself.

Just to be clear, thats valid even if ._iter() itself would become public without adding any new methods?

If there's a simple way to achieve this with changes to existing functions of happy review a PR.

I guess the simplest way would be to move those excluding and aliases stuff from .dict() to ._iter(), that will also simplify .dict() too.

I'll try to play with it soon and will be happy to make a PR once it'll be done.

dmontagu · 2019-11-14T22:50:58Z

For what it's worth, I would be in favor of moving logic from dict into _iter, assuming it didn't result in meaningful performance penalties in any public methods -- I think it would simplify/result in a smaller diff for the implementation of dump_as_type from #812, and the related serialization performance improvements (even when not dumping as a specified type).

Kludex · 2023-04-25T17:43:59Z

On V2, we can leverage .model_dump() as follows:

from pydantic import BaseModel


class Foo(BaseModel):
    a: str
    b: int


foo = Foo(a="a", b=1)

for key, value in foo.model_dump().items():
    print(key, value)

For that reason, I'll be closing this issue. 🙏

gsakkis · 2023-08-21T18:43:21Z

@Kludex model_dump is not equivalent to _iter if there are fields with custom serialization. Here's an example for a bson.ObjectId field serialized as string.

import bson
from pydantic import BaseModel, Field
from pydantic_core import core_schema


class PydanticObjectId(bson.ObjectId):
    @classmethod
    def __get_pydantic_core_schema__(cls, source_type, handler):
        serialization = core_schema.plain_serializer_function_ser_schema(str)
        return core_schema.is_instance_schema(cls, serialization=serialization)


class Foo(BaseModel):
    id: PydanticObjectId = Field(default_factory=PydanticObjectId)
    b: int


foo = Foo(b=1)
print(f"str: {foo}")
print(f"_iter: {dict(foo._iter())}")
print(f"model_dump: {foo.model_dump()}")

Output:

str: id=ObjectId('64e3afb883f5e2aaed388877') b=1
_iter: {'id': ObjectId('64e3afb883f5e2aaed388877'), 'b': 1}
model_dump: {'id': '64e3afb883f5e2aaed388877', 'b': 1}

So unless there is a way to prevent custom serialization, I'd suggest to reopen this issue.

samuelcolvin · 2023-09-15T14:05:13Z

no longer really relevant in V2 - _iter has gone, replaced by the underlying methods in pydantic-core.

gsakkis · 2023-09-15T14:21:52Z

An example of how the previous example would work by using pydantic-core methods instead of _iter would help.

bojanbg · 2023-11-23T09:12:06Z

@Kludex model_dump is not equivalent to _iter if there are fields with custom serialization. Here's an example for a bson.ObjectId field serialized as string.

So unless there is a way to prevent custom serialization, I'd suggest to reopen this issue.

I think there is a problem with your PydanticObjectId definition. If I use the one from here, this is the output I get running your example:

str: id=ObjectId('655f16f55d40bae3a225723f') b=1
_iter: {'id': ObjectId('655f16f55d40bae3a225723f'), 'b': 1}
model_dump: {'id': ObjectId('655f16f55d40bae3a225723f'), 'b': 1}

thusithaC · 2024-02-19T04:23:45Z

So what was the final verdict? Can we use model_dump instead of _iter?

Bobronium added the feature request label Nov 14, 2019

samuelcolvin closed this as completed Nov 14, 2019

Bobronium changed the title ~~Add BaseModel.iter() method~~ Add public BaseModel.iter() method Nov 14, 2019

samuelcolvin reopened this Nov 14, 2019

samuelcolvin mentioned this issue Nov 15, 2019

Move all public methods on a model into a single namespace #1001

Closed

Bobronium mentioned this issue Nov 21, 2019

Refactor ._iter() method, 10x speed boost for dict(model) #1017

Merged

4 tasks

samuelcolvin added the dumping how pydantic serialises models, e.g. via `.dict()` and `.json()` label Jan 3, 2020

samuelcolvin mentioned this issue Jan 3, 2020

add exclude_private parameter #1139

Closed

4 tasks

Kludex closed this as completed Apr 25, 2023

Kludex reopened this Aug 22, 2023

pydantic-hooky bot assigned hramezani Aug 22, 2023

pydantic-hooky bot added the unconfirmed Bug not yet confirmed as valid/applicable label Aug 22, 2023

samuelcolvin closed this as completed Sep 15, 2023

alexdrydew pushed a commit to alexdrydew/pydantic that referenced this issue Dec 23, 2023

Add benchmark for nested/wide model using definitions (pydantic#997)

49126b0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add public BaseModel.iter() method #997

Add public BaseModel.iter() method #997

Bobronium commented Nov 14, 2019 •

edited by pydantic-hooky bot

samuelcolvin commented Nov 14, 2019

samuelcolvin commented Nov 14, 2019

Bobronium commented Nov 14, 2019 •

edited

samuelcolvin commented Nov 14, 2019

samuelcolvin commented Nov 14, 2019

Bobronium commented Nov 14, 2019

dmontagu commented Nov 14, 2019 •

edited

Kludex commented Apr 25, 2023

gsakkis commented Aug 21, 2023

samuelcolvin commented Sep 15, 2023

gsakkis commented Sep 15, 2023

bojanbg commented Nov 23, 2023

thusithaC commented Feb 19, 2024

Add public BaseModel.iter() method #997

Add public BaseModel.iter() method #997

Comments

Bobronium commented Nov 14, 2019 • edited by pydantic-hooky bot

Feature Request

Why

How we can solve it

samuelcolvin commented Nov 14, 2019

samuelcolvin commented Nov 14, 2019

Bobronium commented Nov 14, 2019 • edited

samuelcolvin commented Nov 14, 2019

samuelcolvin commented Nov 14, 2019

Bobronium commented Nov 14, 2019

dmontagu commented Nov 14, 2019 • edited

Kludex commented Apr 25, 2023

gsakkis commented Aug 21, 2023

samuelcolvin commented Sep 15, 2023

gsakkis commented Sep 15, 2023

bojanbg commented Nov 23, 2023

thusithaC commented Feb 19, 2024

Bobronium commented Nov 14, 2019 •

edited by pydantic-hooky bot

Bobronium commented Nov 14, 2019 •

edited

dmontagu commented Nov 14, 2019 •

edited