Simplify broadcast logic for control messages #2501

zhuohan123 · 2024-01-19T00:27:05Z

In this PR, we simplify and unify the previous nasty control message broadcast logic.

njhill · 2024-01-19T03:27:07Z

@zhuohan123 looks much cleaner! But since most of the tensors share a dtype and are the same shape apart from size of one dimension, I was those could be concatenated and just pass the offsets. So that the number of broadcasts done can be minimized. This wouldn't be quite as nice and generic as what you've done but might not look too bad?

WoosukKwon

@zhuohan123 Thanks for cleaning this up! It looks much nicer to me. Please take a look at my minor comments.

WoosukKwon · 2024-01-19T05:39:23Z

vllm/model_executor/parallel_utils/communication_op.py

+class TensorMetadata:
+    """A simple class to hold tensor metadata."""
+
+    def __init__(self, tensor):


Suggested change

def __init__(self, tensor):

def __init__(self, tensor: torch.Tensor):

WoosukKwon · 2024-01-19T05:40:32Z

vllm/model_executor/parallel_utils/communication_op.py

+        self.size = tensor.size()
+
+    def __repr__(self):
+        return (f"TensorMetadata(dtype={self.dtype}, size={self.size})")


Suggested change

return (f"TensorMetadata(dtype={self.dtype}, size={self.size})")

return f"TensorMetadata(dtype={self.dtype}, size={self.size})"

WoosukKwon · 2024-01-19T05:41:59Z

vllm/model_executor/parallel_utils/communication_op.py

+def broadcast_tensor_dict(tensor_dict: Dict[Any, Union[torch.Tensor,
+                                                       Any]] = None,


The tensor_dict type is a bit weird: Shouldn't it be Optional[Dict]? Also, how Union[torch.Tensor, Any] is different from Any?

Thanks I indeed missed an Optional. I use Union[torch.Tensor, Any] to explicitly emphasize that we treat torch.Tensor and other types differently in this function.

WoosukKwon · 2024-01-19T05:43:37Z

vllm/model_executor/parallel_utils/communication_op.py

@@ -104,3 +109,67 @@ def broadcast_object_list(obj_list, src=0):
    # Broadcast.
    torch.distributed.broadcast_object_list(obj_list, src=src)
    return obj_list
+
+
+class TensorMetadata:


Can using a class instead of raw data type incur any additional overhead? If so, can we use named tuple instead?

zhuohan123 · 2024-01-19T06:45:01Z

@zhuohan123 looks much cleaner! But since most of the tensors share a dtype and are the same shape apart from size of one dimension, I was those could be concatenated and just pass the offsets. So that the number of broadcasts done can be minimized. This wouldn't be quite as nice and generic as what you've done but might not look too bad?

Yeah some of the tensors are int and some are float, and also some tensor's shapes can be different. Also, concatenating and splitting tensors themselves have overheads.

zhuohan123 added 5 commits January 19, 2024 00:26

Simplify broadcast logic for control messages

9bffe64

add broadcast test

76b53d9

fix

26427db

format

b5e8492

format

46153b9

zhuohan123 requested review from Yard1 and WoosukKwon January 19, 2024 01:26

rename

053b221

WoosukKwon approved these changes Jan 19, 2024

View reviewed changes

zhuohan123 added 2 commits January 19, 2024 07:37

fix review comments

b024817

fix

665042b

zhuohan123 merged commit ef9b636 into main Jan 19, 2024
16 checks passed

GindaChen mentioned this pull request Jan 20, 2024

Add group as an argument in broadcast ops #2522

Merged

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Simplify broadcast logic for control messages (vllm-project#2501)

8d55050

zhuohan123 deleted the simplify-control-broadcast branch February 22, 2024 18:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify broadcast logic for control messages #2501

Simplify broadcast logic for control messages #2501

zhuohan123 commented Jan 19, 2024 •

edited

Loading

njhill commented Jan 19, 2024

WoosukKwon left a comment

WoosukKwon Jan 19, 2024

WoosukKwon Jan 19, 2024

WoosukKwon Jan 19, 2024

zhuohan123 Jan 19, 2024

WoosukKwon Jan 19, 2024

zhuohan123 commented Jan 19, 2024 •

edited

Loading

	def __init__(self, tensor):
	def __init__(self, tensor: torch.Tensor):

	return (f"TensorMetadata(dtype={self.dtype}, size={self.size})")
	return f"TensorMetadata(dtype={self.dtype}, size={self.size})"

		def broadcast_tensor_dict(tensor_dict: Dict[Any, Union[torch.Tensor,
		Any]] = None,

Simplify broadcast logic for control messages #2501

Simplify broadcast logic for control messages #2501

Conversation

zhuohan123 commented Jan 19, 2024 • edited Loading

njhill commented Jan 19, 2024

WoosukKwon left a comment

Choose a reason for hiding this comment

WoosukKwon Jan 19, 2024

Choose a reason for hiding this comment

WoosukKwon Jan 19, 2024

Choose a reason for hiding this comment

WoosukKwon Jan 19, 2024

Choose a reason for hiding this comment

zhuohan123 Jan 19, 2024

Choose a reason for hiding this comment

WoosukKwon Jan 19, 2024

Choose a reason for hiding this comment

zhuohan123 commented Jan 19, 2024 • edited Loading

zhuohan123 commented Jan 19, 2024 •

edited

Loading

zhuohan123 commented Jan 19, 2024 •

edited

Loading