[Usage]: How to specify a GPU when i use vllm by 'python -m vllm.entrypoints.openai.api_server --served-model-name..' #5585

GoogleAlphaZero · 2024-06-17T03:29:44Z

The output of `python collect_env.py`

I want to run inference of a [specific model](put link here). I don't know how to integrate it with vllm.

The text was updated successfully, but these errors were encountered:

DarkLight1337 · 2024-06-17T04:33:39Z

You can use CUDA_VISIBLE_DEVICES environment variable to limit the GPUs that can be used by vLLM.

GoogleAlphaZero · 2024-06-17T04:34:12Z

这是来自QQ邮箱的假期自动回复邮件。您好，我最近正在休假中，无法亲自回复您的邮件。我将在假期结束后，尽快给您回复。

GoogleAlphaZero added the usage How to use vllm label Jun 17, 2024

hmellor closed this as completed Jul 4, 2024

Provide feedback