fix underlying issue with `test_from_save_pretrained_dtype_inference` is that the `model.to(dtype)` cast at

I think an underlying issue with `test_from_save_pretrained_dtype_inference` is that the `model.to(dtype)` cast at

https://github.com/huggingface/diffusers/blob/5d10b4de3b65b1debb8a94de3a0e3bfa51fc628a/tests/models/testing_utils/common.py#L484

is not dtype-aware (it will cast everything to `dtype`), but `from_pretrained(..., torch_dtype=dtype)` is dtype-aware (it respects `_keep_in_fp32_modules`, etc.). This causes the behavior of the two to diverge in several scenarios:

1. There are `_keep_in_fp32_modules` specified on the `model` (the more common case)
2. A non-persistent buffer like `inv_freq` is created with an explicit dtype (which is the case here)

which leads to a divergence between the outputs of `model` and `model_loaded`.

_Originally posted by @dg845 in https://github.com/huggingface/diffusers/pull/13862#discussion_r3359815645_

Cc: @dn6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix underlying issue with `test_from_save_pretrained_dtype_inference` is that the `model.to(dtype)` cast at #13869

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

fix underlying issue with test_from_save_pretrained_dtype_inference is that the model.to(dtype) cast at #13869

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

fix underlying issue with `test_from_save_pretrained_dtype_inference` is that the `model.to(dtype)` cast at #13869