[Doc] Add doc for running vLLM on the cloud #426

Michaelvll · 2023-07-11T03:44:44Z

No description provided.

zhuohan123

Thanks for your contribution! Left some comments.

docs/source/serving/run_on_cloud.rst

docs/source/index.rst

docs/source/serving/run_on_cloud.rst

zhuohan123 · 2023-07-12T15:25:45Z

docs/source/serving/run_on_cloud.rst

+        python -u -m vllm.entrypoints.api_server \
+                        --model $MODEL_NAME \
+                        --tensor-parallel-size $SKYPILOT_NUM_GPUS_PER_NODE \
+                        --tokenizer hf-internal-testing/llama-tokenizer 2>&1 | tee api_server.log &


Make tokenizer also an envvar?

Sounds good. Changed. Thanks!

zhuohan123 · 2023-07-12T15:26:19Z

docs/source/serving/run_on_cloud.rst

+        conda activate vllm
+        echo 'Starting vllm api server...'
+        python -u -m vllm.entrypoints.api_server \
+                        --model $MODEL_NAME \


Where do we specify $MODEL_NAME?

It is defined in envs section above. : )

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

zhuohan123

LGTM! Can you rename run_on_cloud.rst to run_on_sky.rst before we merge? @Michaelvll

Michaelvll · 2023-07-14T04:07:10Z

Oops, sorry for missing it! Fixed the filename. I also changed the title to Running on clouds with SkyPilot. Otherwise, the table content will be in two lines. Wdyt?

Tested:

make html; cd build; python -m http.server

vs

WoosukKwon · 2023-07-14T04:39:37Z

@Michaelvll @zhuohan123 Sorry for the late response, but why don't we use a single GPU and a smaller model for the example?

Michaelvll · 2023-07-14T17:34:35Z

@WoosukKwon That sounds good to me. Do you think single A100 with LLaMA-13b work?

zhuohan123 · 2023-07-16T05:58:17Z

@WoosukKwon That sounds good to me. Do you think single A100 with LLaMA-13b work?

This should be great!

Michaelvll · 2023-07-16T19:12:09Z

Done. PTAL @WoosukKwon @zhuohan123

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Add doc for vLLM+SkyPilot

dad7962

zhuohan123 requested changes Jul 12, 2023

View reviewed changes

Michaelvll and others added 4 commits July 12, 2023 14:33

Update docs/source/serving/run_on_cloud.rst

6278391

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Update docs/source/index.rst

bd9cbf4

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Update docs/source/serving/run_on_cloud.rst

a8e0253

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

make tokenizer a envvar

1313ee7

Michaelvll requested a review from zhuohan123 July 13, 2023 20:29

zhuohan123 approved these changes Jul 14, 2023

View reviewed changes

Fix title and file name

ddbb810

Use 13B and single A100 instead

a07087e

style fix

32afa30

zhuohan123 merged commit 58df288 into vllm-project:main Jul 16, 2023
2 checks passed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

[Doc] Add doc for running vLLM on the cloud (vllm-project#426)

c590716

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

[Doc] Add doc for running vLLM on the cloud (vllm-project#426)

65b8c8b

Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] Add doc for running vLLM on the cloud #426

[Doc] Add doc for running vLLM on the cloud #426

Michaelvll commented Jul 11, 2023

zhuohan123 left a comment

zhuohan123 Jul 12, 2023

Michaelvll Jul 12, 2023 •

edited

zhuohan123 Jul 12, 2023

Michaelvll Jul 12, 2023

zhuohan123 left a comment

Michaelvll commented Jul 14, 2023 •

edited

WoosukKwon commented Jul 14, 2023

Michaelvll commented Jul 14, 2023

zhuohan123 commented Jul 16, 2023

Michaelvll commented Jul 16, 2023

[Doc] Add doc for running vLLM on the cloud #426

[Doc] Add doc for running vLLM on the cloud #426

Conversation

Michaelvll commented Jul 11, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

zhuohan123 Jul 12, 2023

Choose a reason for hiding this comment

Michaelvll Jul 12, 2023 • edited

Choose a reason for hiding this comment

zhuohan123 Jul 12, 2023

Choose a reason for hiding this comment

Michaelvll Jul 12, 2023

Choose a reason for hiding this comment

zhuohan123 left a comment

Choose a reason for hiding this comment

Michaelvll commented Jul 14, 2023 • edited

WoosukKwon commented Jul 14, 2023

Michaelvll commented Jul 14, 2023

zhuohan123 commented Jul 16, 2023

Michaelvll commented Jul 16, 2023

Michaelvll Jul 12, 2023 •

edited

Michaelvll commented Jul 14, 2023 •

edited