Add support to use config dtype in HybridChunkedCache #38908

vivekkhandelwal1 · 2025-06-19T10:56:43Z

What does this PR do?

This PR fixes the issue with the HybridChunkedCache initialisation since it does not consider the dtype present in the model config. Here:

transformers/src/transformers/cache_utils.py

Line 1806 in b949747

self._dtype = dtype

Signed-off-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

vivekkhandelwal1 · 2025-06-19T10:58:42Z

Hi @ydshieh @Rocketknight1! It would be great if you can find some time to review this change.

Rocketknight1 · 2025-06-19T11:50:00Z

cc @gante

vivekkhandelwal1 · 2025-06-23T07:33:28Z

Hi @gante, can you please review this PR?

gante · 2025-06-23T15:25:32Z

Hi @vivekkhandelwal1

I don't think this is the right solution: config.torch_dtype may be out of sync with the model's actual dtype, e.g. when we manually cast a model with model.to(torch.XX) after loading it.

Would you be able to expand the description of the issue you're seeing? I might be able to provide solutions 🤗

Add support to use config dtype in HybridChunkedCache

cf41824

Signed-off-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support to use config dtype in HybridChunkedCache #38908

Add support to use config dtype in HybridChunkedCache #38908

Uh oh!

vivekkhandelwal1 commented Jun 19, 2025 •

edited

Loading

Uh oh!

vivekkhandelwal1 commented Jun 19, 2025

Uh oh!

Rocketknight1 commented Jun 19, 2025

Uh oh!

vivekkhandelwal1 commented Jun 23, 2025

Uh oh!

gante commented Jun 23, 2025

Uh oh!

Uh oh!

Add support to use config dtype in HybridChunkedCache #38908

Are you sure you want to change the base?

Add support to use config dtype in HybridChunkedCache #38908

Uh oh!

Conversation

vivekkhandelwal1 commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

vivekkhandelwal1 commented Jun 19, 2025

Uh oh!

Rocketknight1 commented Jun 19, 2025

Uh oh!

vivekkhandelwal1 commented Jun 23, 2025

Uh oh!

gante commented Jun 23, 2025

Uh oh!

Uh oh!

vivekkhandelwal1 commented Jun 19, 2025 •

edited

Loading