[CI/Test] fix swap test for multi gpu #4689

youkaichao · 2024-05-08T17:19:07Z

When we run the test code with multiple GPUs, the slot mapping lives in cuda, which is cuda:0 by default. The kv cache can live in cuda:1, so the copy kernel will trigger RuntimeError: CUDA error: an illegal memory access was encountered.

This PR changes them to be device together.

LiuXiaoxuanPKU

LGTM, thanks for the fix!

fix swap test for multi gpu

9a29692

simon-mo requested a review from LiuXiaoxuanPKU May 8, 2024 18:56

LiuXiaoxuanPKU approved these changes May 8, 2024

View reviewed changes

WoosukKwon merged commit 230c4b3 into vllm-project:main May 8, 2024
24 of 25 checks passed

youkaichao deleted the fix_swap_test branch May 8, 2024 20:15

z103cb pushed a commit to z103cb/opendatahub_vllm that referenced this pull request May 9, 2024

[CI/Test] fix swap test for multi gpu (vllm-project#4689)

cc3a658

robertgshaw2-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request May 19, 2024

[CI/Test] fix swap test for multi gpu (vllm-project#4689)

1fe8d9c

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024

[CI/Test] fix swap test for multi gpu (vllm-project#4689)

a696be1

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

[CI/Test] fix swap test for multi gpu (vllm-project#4689)

2a33244

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/Test] fix swap test for multi gpu #4689

[CI/Test] fix swap test for multi gpu #4689

youkaichao commented May 8, 2024

LiuXiaoxuanPKU left a comment

[CI/Test] fix swap test for multi gpu #4689

[CI/Test] fix swap test for multi gpu #4689

Conversation

youkaichao commented May 8, 2024

LiuXiaoxuanPKU left a comment

Choose a reason for hiding this comment