Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixes assertion failure in prefix caching: the lora index mapping should respect prefix_len. #2688

Merged
merged 1 commit into from
Jan 31, 2024

Conversation

sighingnow
Copy link
Contributor

Address issue #2612.

…uld respect prefix_len

Signed-off-by: Tao He <sighingnow@gmail.com>
Copy link
Collaborator

@Yard1 Yard1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@Yard1 Yard1 merged commit d69ff0c into vllm-project:main Jan 31, 2024
17 checks passed
NikolaBorisov pushed a commit to deepinfra/vllm that referenced this pull request Jan 31, 2024
…uld respect prefix_len (vllm-project#2688)

Signed-off-by: Tao He <sighingnow@gmail.com>
@sighingnow sighingnow deleted the ht/fixes-prefix-caching branch February 1, 2024 02:12
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
…uld respect prefix_len (vllm-project#2688)

Signed-off-by: Tao He <sighingnow@gmail.com>
alexm-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request Feb 13, 2024
…uld respect prefix_len (vllm-project#2688)

Signed-off-by: Tao He <sighingnow@gmail.com>
xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 20, 2024
…uld respect prefix_len (vllm-project#2688)

Signed-off-by: Tao He <sighingnow@gmail.com>
xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024
…uld respect prefix_len (vllm-project#2688)

Signed-off-by: Tao He <sighingnow@gmail.com>
xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024
…uld respect prefix_len (vllm-project#2688)

Signed-off-by: Tao He <sighingnow@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants