It is recommended to use a dedicated device for vLLM

Hello,

I'm following your's [GRPO multi-GPU](https://swift.readthedocs.io/en/latest/Instruction/GRPO.html) approach on SLURM environment. I do understand why we would like to use separate GPU card(s) for vLLM deployment, however I've got errors with `NPROC_PER_NODE`. I was told that its number should be equal to GPU's count which is incorrect with separate cards for vLLM idea. I can use max 4 GPUs per one node.

`AssertionError: Colocate mode requires device_count(4) == num_infer_workers(4). Please check if your device count matches NPROC_PER_NODE setting.`

Any idea's why?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

It is recommended to use a dedicated device for vLLM #3719

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

It is recommended to use a dedicated device for vLLM #3719

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions