Polish DeepSpeed blog post #1

WoosukKwon · 2023-11-14T20:26:47Z

No description provided.

zhuohan123 · 2023-11-14T20:31:35Z

_posts/2023-11-14-notes-vllm-vs-deepspeed.md

- vLLM’s mission is to build the fastest and easiest-to-use open-source LLM inference and serving engine. It is Apache 2.0 and community-owned with broad model and optimization support.
+
+- vLLM matches DeepSpeed's speed in common scenarios and surpasses it when handling longer outputs.
+- DeepSpeed only outperforms vLLM in scenarios with long prompts and short outputs, due to its Dynamic SplitFuse optimization. This optimization is on vLLM’s roadmap.


Suggested change

- DeepSpeed only outperforms vLLM in scenarios with long prompts and short outputs, due to its Dynamic SplitFuse optimization. This optimization is on vLLM’s roadmap.

- DeepSpeed only outperforms vLLM in scenarios with long prompts and short outputs with its Dynamic SplitFuse optimization. This optimization is on vLLM’s roadmap.

I think the current one is a bit better since it only has 1 with?

_posts/2023-11-14-notes-vllm-vs-deepspeed.md

zhuohan123 · 2023-11-14T20:39:27Z

_posts/2023-11-14-notes-vllm-vs-deepspeed.md

+In our blog today, we'll elucidate the specific scenarios where the Dynamic SplitFuse technique is advantageous, noting that these cases are relatively limited.
+For the majority of workloads, vLLM is faster than (or performs comparably to) DeepSpeed MII.

-In this post, we will discuss the difference between the two systems, share our benchmarks, and discuss future steps.


Should we keep this?

I think this is redundant. In the previous sentence we already said "In this blog, ..."

_posts/2023-11-14-notes-vllm-vs-deepspeed.md

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

zhuohan123

LGTM! Thanks for the fix!

Update 2025-01-12-intro-to-llama-stack-with-vllm.md

Polish

734b320

WoosukKwon requested review from LiuXiaoxuanPKU and zhuohan123 and removed request for LiuXiaoxuanPKU and zhuohan123 November 14, 2023 20:26

Fix link{

1e4fef6

zhuohan123 reviewed Nov 14, 2023

View reviewed changes

WoosukKwon and others added 6 commits November 14, 2023 12:44

Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md

96d1e57

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md

e51ece8

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md

ba9eb79

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md

a0f139a

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md

5232941

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md

783c762

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

WoosukKwon requested a review from zhuohan123 November 14, 2023 20:46

zhuohan123 approved these changes Nov 14, 2023

View reviewed changes

WoosukKwon merged commit 600dace into main Nov 14, 2023

WoosukKwon deleted the woosuk branch November 14, 2023 21:50

simon-mo pushed a commit that referenced this pull request Jan 27, 2025

Merge pull request #1 from terrytangyuan/ashwinb-patch-1

a4ea999

Update 2025-01-12-intro-to-llama-stack-with-vllm.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Polish DeepSpeed blog post #1

Polish DeepSpeed blog post #1

Uh oh!

WoosukKwon commented Nov 14, 2023

Uh oh!

zhuohan123 Nov 14, 2023

Uh oh!

WoosukKwon Nov 14, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhuohan123 Nov 14, 2023

Uh oh!

WoosukKwon Nov 14, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhuohan123 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	- DeepSpeed only outperforms vLLM in scenarios with long prompts and short outputs, due to its Dynamic SplitFuse optimization. This optimization is on vLLM’s roadmap.
	- DeepSpeed only outperforms vLLM in scenarios with long prompts and short outputs with its Dynamic SplitFuse optimization. This optimization is on vLLM’s roadmap.

Polish DeepSpeed blog post #1

Polish DeepSpeed blog post #1

Uh oh!

Conversation

WoosukKwon commented Nov 14, 2023

Uh oh!

zhuohan123 Nov 14, 2023

Choose a reason for hiding this comment

Uh oh!

WoosukKwon Nov 14, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhuohan123 Nov 14, 2023

Choose a reason for hiding this comment

Uh oh!

WoosukKwon Nov 14, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants