[torchao] fix safetensors for sharding #28169

liangel-02 · 2025-11-05T23:57:12Z

Summary

Previously, our safetensors implementation for tensor subclasses didn't consider the possibility of different tensor attributes for one instance of a tensor subclass (ie qdata or scale for Float8Tensor) being sharded to different files.

This PR handles that case by:

Defining the state dict outside of the loop that iterates through shard files
After processing a full tensor subclass in unflatten_tensor_state_dict, we delete the corresponding tensor attributes (ie qdata, scale) from the original state dict passed in
Whatever is remaining is saved in the state dict for the next iteration

Testing
pytest tests/quantization/test_torchao.py::test_safetensors_model_loading_with_params -s

tests/quantization/test_torchao.py

vllm/model_executor/model_loader/weight_utils.py

heheda12345 · 2025-11-07T07:38:35Z

tests/quantization/test_torchao.py



 @pytest.mark.skipif(not TORCHAO_AVAILABLE, reason="torchao is not available")
 @pytest.mark.skip(


do we need to skip now that torch2.9 is released?

i think we still need to skip since this test depends on changes in torchao main cc @jerryzh168

yeah we need a separate update to upgrade the torchao version used in testing in vllm:

vllm/.buildkite/test-amd.yaml

Line 616 in da786e3

- uv pip install --system torchao==0.13.0

and make sure the tests are passing

heheda12345

LGTM!

vllm/model_executor/model_loader/weight_utils.py

liangel-02 · 2025-11-17T15:14:51Z

@heheda12345 can you pls merge? thanks!

heheda12345 · 2025-11-17T22:25:42Z

Of course! I'm retrying failed test.

mergify · 2025-11-19T01:04:07Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @liangel-02.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Angel Li <liangel@meta.com>

liangel-02 · 2025-11-19T20:21:52Z

@heheda12345 a rebase seems to have fixed ci, can u help me merge ty!

heheda12345 · 2025-11-20T00:39:54Z

Merged. Thanks!

Signed-off-by: Angel Li <liangel@meta.com>

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: LuminolT <lumischen01@gmail.com>

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: Runkai Tao <rt572@physics.rutgers.edu>

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: jiang1.li <jiang1.li@intel.com>

Signed-off-by: Angel Li <liangel@meta.com>

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: Xingyu Liu <charlotteliu12x@gmail.com>

liangel-02 force-pushed the torchao-safetensors branch 3 times, most recently from 1189efe to 91a71f1 Compare November 6, 2025 16:22

liangel-02 marked this pull request as ready for review November 6, 2025 16:32

liangel-02 requested review from 22quinn, mgoin, pavanimajety, robertgshaw2-redhat and yewentao256 as code owners November 6, 2025 16:32

heheda12345 reviewed Nov 7, 2025

View reviewed changes

liangel-02 force-pushed the torchao-safetensors branch 2 times, most recently from a27f7af to 428de94 Compare November 7, 2025 19:52

heheda12345 approved these changes Nov 11, 2025

View reviewed changes

heheda12345 enabled auto-merge (squash) November 11, 2025 05:27

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 11, 2025

auto-merge was automatically disabled November 12, 2025 18:34
Head branch was pushed to by a user without write access

liangel-02 force-pushed the torchao-safetensors branch 4 times, most recently from 3de2e14 to ffcf29c Compare November 14, 2025 20:36

jerryzh168 reviewed Nov 14, 2025

View reviewed changes

vllm/model_executor/model_loader/weight_utils.py Outdated Show resolved Hide resolved

liangel-02 force-pushed the torchao-safetensors branch from ffcf29c to 458a632 Compare November 17, 2025 15:14

liangel-02 force-pushed the torchao-safetensors branch from 458a632 to 97726c0 Compare November 17, 2025 18:54

heheda12345 enabled auto-merge (squash) November 17, 2025 22:25

auto-merge was automatically disabled November 18, 2025 22:56
Head branch was pushed to by a user without write access

liangel-02 force-pushed the torchao-safetensors branch from 97726c0 to fda63f2 Compare November 18, 2025 22:56

mergify bot added the needs-rebase label Nov 19, 2025

liangel-02 force-pushed the torchao-safetensors branch from fda63f2 to c181744 Compare November 19, 2025 16:15

mergify bot removed the needs-rebase label Nov 19, 2025

liangel-02 added 2 commits November 19, 2025 09:59

fix safetensors for sharding

0ca5f2f

Signed-off-by: Angel Li <liangel@meta.com>

remove lm_head changes

a0c7cff

Signed-off-by: Angel Li <liangel@meta.com>

liangel-02 force-pushed the torchao-safetensors branch from c181744 to a0c7cff Compare November 19, 2025 17:59

heheda12345 merged commit 1d64287 into vllm-project:main Nov 20, 2025
44 checks passed

Victor49152 pushed a commit to Victor49152/vllm that referenced this pull request Nov 20, 2025

[torchao] fix safetensors for sharding (vllm-project#28169)

4200f11

Signed-off-by: Angel Li <liangel@meta.com>

LuminolT pushed a commit to LuminolT/vllm that referenced this pull request Nov 21, 2025

[torchao] fix safetensors for sharding (vllm-project#28169)

c251243

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: LuminolT <lumischen01@gmail.com>

RunkaiTao pushed a commit to RunkaiTao/vllm that referenced this pull request Nov 24, 2025

[torchao] fix safetensors for sharding (vllm-project#28169)

6b1a000

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: Runkai Tao <rt572@physics.rutgers.edu>

bigPYJ1151 pushed a commit that referenced this pull request Nov 25, 2025

[torchao] fix safetensors for sharding (#28169)

dc95c64

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: jiang1.li <jiang1.li@intel.com>

bringlein pushed a commit to bringlein/vllm that referenced this pull request Nov 26, 2025

[torchao] fix safetensors for sharding (vllm-project#28169)

16986a8

Signed-off-by: Angel Li <liangel@meta.com>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[torchao] fix safetensors for sharding (vllm-project#28169)

38e4f16

Signed-off-by: Angel Li <liangel@meta.com>

kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request Dec 1, 2025

[torchao] fix safetensors for sharding (vllm-project#28169)

118b7a5

Signed-off-by: Angel Li <liangel@meta.com>

charlotte12l pushed a commit to charlotte12l/vllm that referenced this pull request Dec 5, 2025

[torchao] fix safetensors for sharding (vllm-project#28169)

41d9bee

Signed-off-by: Angel Li <liangel@meta.com> Signed-off-by: Xingyu Liu <charlotteliu12x@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[torchao] fix safetensors for sharding #28169

[torchao] fix safetensors for sharding #28169

Uh oh!

liangel-02 commented Nov 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

Uh oh!

Uh oh!

heheda12345 Nov 7, 2025

Uh oh!

liangel-02 Nov 7, 2025

Uh oh!

jerryzh168 Nov 7, 2025

Uh oh!

heheda12345 left a comment

Uh oh!

Uh oh!

liangel-02 commented Nov 17, 2025

Uh oh!

heheda12345 commented Nov 17, 2025

Uh oh!

mergify bot commented Nov 19, 2025

Uh oh!

liangel-02 commented Nov 19, 2025

Uh oh!

Uh oh!

heheda12345 commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		@pytest.mark.skipif(not TORCHAO_AVAILABLE, reason="torchao is not available")
		@pytest.mark.skip(

Uh oh!

[torchao] fix safetensors for sharding #28169

[torchao] fix safetensors for sharding #28169

Uh oh!

Conversation

liangel-02 commented Nov 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

heheda12345 Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

liangel-02 Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

heheda12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

liangel-02 commented Nov 17, 2025

Uh oh!

heheda12345 commented Nov 17, 2025

Uh oh!

mergify bot commented Nov 19, 2025

Uh oh!

liangel-02 commented Nov 19, 2025

Uh oh!

Uh oh!

heheda12345 commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

liangel-02 commented Nov 5, 2025 •

edited by github-actions bot

Loading