[Core][Distributed] support both cpu and device tensor in broadcast tensor dict #4660

youkaichao · 2024-05-07T21:58:18Z

Prior to this PR, broadcast_tensor_dict can only work for cuda tensor.

This PR enables both cuda tensor and cpu tensor for broadcast_tensor_dict.

It will be useful when we have some metadata in cpu tensor, e.g. blocks_to_swap_in and blocks_to_swap_out to be introduced in #4659 .

Note: blocks_to_copy is still a cuda tensor, because the src and target for copy both lives in GPU, and we have a dedicated copy kernel for it. blocks_to_swap_in and blocks_to_swap_out has to be cpu tensor, because they are kernel launch arguments.

zhuohan123

LGTM!

…-project#4660) [Core][Distributed] support both cpu and device tensor in broadcast tensor dict (vllm-project#4660)

vllm/distributed/communication_op.py

…-project#4660) [Core][Distributed] support both cpu and device tensor in broadcast tensor dict (vllm-project#4660)

youkaichao added 3 commits May 7, 2024 14:47

support both cpu and device tensor in broadcast tensor dict

67695e7

update tests

847d47c

fix device packing

ae74d03

youkaichao requested a review from zhuohan123 May 7, 2024 22:18

zhuohan123 approved these changes May 8, 2024

View reviewed changes

youkaichao merged commit cc466a3 into vllm-project:main May 8, 2024
55 checks passed

youkaichao deleted the split_broadcast branch May 8, 2024 02:36

jikunshang reviewed May 9, 2024

View reviewed changes

vllm/distributed/communication_op.py Show resolved Hide resolved

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024

[Core][Distributed] support cpu&device in broadcast tensor dict (vllm…

1571342

…-project#4660) [Core][Distributed] support both cpu and device tensor in broadcast tensor dict (vllm-project#4660)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core][Distributed] support both cpu and device tensor in broadcast tensor dict #4660

[Core][Distributed] support both cpu and device tensor in broadcast tensor dict #4660

youkaichao commented May 7, 2024

zhuohan123 left a comment

[Core][Distributed] support both cpu and device tensor in broadcast tensor dict #4660

[Core][Distributed] support both cpu and device tensor in broadcast tensor dict #4660

Conversation

youkaichao commented May 7, 2024

zhuohan123 left a comment

Choose a reason for hiding this comment