[DOC] Add additional comments for LLMEngine and AsyncLLMEngine #1011

litone01 · 2023-09-11T06:22:10Z

For the following functions:

LLMEngine: add_request, abort_request, step and _init_cache
AsyncLLMEngine: generate and abort

Summary of changes:

Added documentation pages in sphinx for LLMEngine and AsyncLLMEngine. To address questions related to memory usage, the documentation for init_cache from LLMEngine is also added.
To complement the existing comments, added implementation details and examples for the functions where necessary.
Added a diagram to explain how step() performs one decoding iteration.

TODO:

Fix the linting error
Update the documentation to reflect the changes introduced during the past week.

Discussion:

Is there any suggestion for how the diagram may be improved for the step() function? For example, illustrate in the context of an example prompt? Would like to receive any advice on how to make it more informative for the readers.

Partially addresses the documentation issues in #244.

Could you kindly review this PR? Thanks! @zhuohan123 @WoosukKwon cc @LiuXiaoxuanPKU

litone01 · 2023-09-11T12:32:14Z

Close the PR to fix the TODOs at the moment.

WoosukKwon · 2023-10-02T18:28:34Z

Hi @litone01, sorry for the very late review and many thanks for the contribution. I've tried the PR in my laptop and got this error:

WARNING: autodoc: failed to import class 'async_llm_engine.AsyncLLMEngine' from module 'vllm.engine'; the following exception was raised:
No module named 'torch'

The generated doc does not contain anything under LLMEngine and AsyncLLMEngine. Could you take a look at this?

litone01 · 2023-10-03T03:29:35Z

Hi @litone01, sorry for the very late review and many thanks for the contribution. I've tried the PR in my laptop and got this error:
WARNING: autodoc: failed to import class 'async_llm_engine.AsyncLLMEngine' from module 'vllm.engine'; the following exception was raised:
No module named 'torch'
The generated doc does not contain anything under LLMEngine and AsyncLLMEngine. Could you take a look at this?

Thanks @WoosukKwon !

Is it because you are not using the correct python environment? This requires the python dependencies that we used for compilation and development to be present in the environment. For example, the torch package. It works fine on my local setup.

~~Also, let me fix the conflicted code.~~ There seems to be a compilation issue (different from the one woosuk showed) after rebasing with the latest changes. Let me try to resolve that.

…yncLLMEngine

litone01 · 2023-10-03T04:31:05Z

Could I seek your advice on the following error during doc compilation that happens after my latest rebase?

WARNING: autodoc: failed to import class 'async_llm_engine.AsyncLLMEngine' from module 'vllm.engine'; the following exception was raised:

cannot import name 'cuda_utils' from partially initialized module 'vllm' (most likely due to a circular import) (/Users/jerry/projects/vllm/vllm/__init__.py)

My guess is two reasons:

there is a circular import. However, a search that suggests cuda_utils is only used in utils.py and we did not add anything into cuda_utils from utils.py. Could it be caused by how it is attached in setup.py?
~~there is a name conflict with other python modules.~~ changing the module name from vllm.cuda_utils to vllm.cuda_utils2 in setup.py and utils.py does not resolve the issue.
Something wrong with Sphinx autodoc. I need to investigate further.

Thank you!

simon-mo · 2023-11-30T00:01:39Z

I think this is because cuda_utils is a pre-built shared object. It needs a GPU device to build. However, a practical fix would be introduce mocks for these modules so sphinx can safely ignore them. Please take a look at how Ray solve this facing same challenges

https://github.com/ray-project/ray/blob/57c7988ff50f3912aa59da6c90fe74d64b35c63c/doc/source/conf.py#L461-L527

@litone01 are you able to continue to work on this? Thank you!

litone01 · 2023-11-30T02:47:53Z

I think this is because cuda_utils is a pre-built shared object. It needs a GPU device to build. However, a practical fix would be introduce mocks for these modules so sphinx can safely ignore them. Please take a look at how Ray solve this facing same challenges

https://github.com/ray-project/ray/blob/57c7988ff50f3912aa59da6c90fe74d64b35c63c/doc/source/conf.py#L461-L527

@litone01 are you able to continue to work on this? Thank you!

Thanks for the suggestion! Sure, I will take a closer look as soon as I have the bandwidth. I may need slightly more time to update the documentation with the latest code again.

Thanks!

…project#1011)

litone01 closed this Sep 11, 2023

litone01 reopened this Sep 11, 2023

zhuohan123 added the documentation Improvements or additions to documentation label Sep 12, 2023

WoosukKwon self-requested a review October 2, 2023 17:58

litone01 added 10 commits October 3, 2023 11:41

Add export of documentation for core functions under LLMEngine and As…

f3eaf18

…yncLLMEngine

add additional comments for functions related to LLMEngine

f16ae30

add additional comments for functions related to AsyncLLMEngine

8d375e9

add examples for add_request, abort_request and step

0eeb8aa

add examples for generate and abort

44b8a64

add documentation for _init_cache

cfbe204

add an diagram to explain step()

d686c2b

minor style adjusting

e7178a6

update documentation based on new changes in upstream repo

ee72b5c

fix linting issues

628a98a

zhuohan123 requested a review from simon-mo November 29, 2023 18:58

LiuXiaoxuanPKU and others added 3 commits January 10, 2024 11:49

add mock support

285348a

Merge branch 'main' into doc/core-fun

6ab6cdd

fix capitalization

834bc48

simon-mo approved these changes Jan 12, 2024

View reviewed changes

simon-mo merged commit 6549aef into vllm-project:main Jan 12, 2024
2 of 4 checks passed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Jan 18, 2024

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine (vllm-…

77f7f61

…project#1011)

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine (vllm-…

4f73512

…project#1011)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine #1011

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine #1011

litone01 commented Sep 11, 2023 •

edited

litone01 commented Sep 11, 2023

WoosukKwon commented Oct 2, 2023

litone01 commented Oct 3, 2023 •

edited

litone01 commented Oct 3, 2023 •

edited

simon-mo commented Nov 30, 2023

litone01 commented Nov 30, 2023 •

edited

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine #1011

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine #1011

Conversation

litone01 commented Sep 11, 2023 • edited

litone01 commented Sep 11, 2023

WoosukKwon commented Oct 2, 2023

litone01 commented Oct 3, 2023 • edited

litone01 commented Oct 3, 2023 • edited

simon-mo commented Nov 30, 2023

litone01 commented Nov 30, 2023 • edited

litone01 commented Sep 11, 2023 •

edited

litone01 commented Oct 3, 2023 •

edited

litone01 commented Oct 3, 2023 •

edited

litone01 commented Nov 30, 2023 •

edited