Changed scheduler to use deques instead of lists #2290

NadavShmayo · 2023-12-27T19:38:03Z

Currently the scheduler uses lists to store the running, waiting and swapped requests.
When iterating over each state queue we pop the first item and append to a new list, which is not good for performance since each pop is O(N) time complexity, which means the scheduler currently runs in O(N^2) time complexity (a pop happens for each item in the state queue).
Instead of using lists we could use deques, since we can pop the first item from a deque in O(1) time complexity, making the scheduler run in O(N) time complexity instead of O(N^2).

I wasn't sure whether I should keep the same types in the SchedulerOutputs class and cast from a deque to a list before returning the result, but since it seems to work with the SchedulerOutputs contains deques instead of lists I decided to change the types.

WoosukKwon

Hi @NadavShmayo, thanks for submitting the PR! Yes, it seems we should use deque instead of list for the queues in our scheduler. I left some minor comments on the PR. Please take a look at them!

vllm/core/policy.py

vllm/core/scheduler.py

WoosukKwon

LGTM. I've made some minor changes to accelerate the merge. Thanks again for the PR!

NadavShmayo · 2024-01-07T22:00:51Z

LGTM. I've made some minor changes to accelerate the merge. Thanks again for the PR!

Hey, sorry for the delayed response.
I didn't get to fixing the code yet, meant to do it tomorrow, but good to see you already did!

Regarding the changes in the new_seq_lens logic, I thought it was better this way, instead of creating a new list each time to calculate the total batched tokens which might be slow.
But these are minor changes anyways.

Thanks for the review! 😄

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

vllm-project#2290 changed the scheduler seq group lists to be deques for more efficient updates, but missed one place where the `running` deque gets converted back to a list.

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Changed scheduler to use deques instead of lists

e645966

WoosukKwon self-requested a review January 2, 2024 17:25

WoosukKwon requested changes Jan 2, 2024

View reviewed changes

WoosukKwon added 3 commits January 7, 2024 17:22

Merge branch 'main' into scheduler_use_deque

bfbfedc

Fix

1596c83

Minor

0852ed1

WoosukKwon approved these changes Jan 7, 2024

View reviewed changes

WoosukKwon merged commit 05921a9 into vllm-project:main Jan 7, 2024
2 of 3 checks passed

chenxu2048 mentioned this pull request Jan 8, 2024

[BUG] RuntimeError: deque mutated during iteration in abort_seq_group #2371

Merged

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Jan 18, 2024

Changed scheduler to use deques instead of lists (vllm-project#2290)

fab149b

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

njhill mentioned this pull request Jan 20, 2024

[Fix] Keep scheduler.running as deque #2523

Merged

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Changed scheduler to use deques instead of lists (vllm-project#2290)

85148f9

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changed scheduler to use deques instead of lists #2290

Changed scheduler to use deques instead of lists #2290

NadavShmayo commented Dec 27, 2023 •

edited

Loading

WoosukKwon left a comment

WoosukKwon left a comment

NadavShmayo commented Jan 7, 2024

Changed scheduler to use deques instead of lists #2290

Changed scheduler to use deques instead of lists #2290

Conversation

NadavShmayo commented Dec 27, 2023 • edited Loading

WoosukKwon left a comment

Choose a reason for hiding this comment

WoosukKwon left a comment

Choose a reason for hiding this comment

NadavShmayo commented Jan 7, 2024

NadavShmayo commented Dec 27, 2023 •

edited

Loading