Skip to content

Conversation

@zhuohan123
Copy link
Member

No description provided.

@zhuohan123 zhuohan123 requested a review from WoosukKwon June 20, 2023 03:14
Copy link
Collaborator

@WoosukKwon WoosukKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@zhuohan123 zhuohan123 merged commit fc72e39 into main Jun 20, 2023
@zhuohan123 zhuohan123 deleted the change-image-url branch June 20, 2023 03:15
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Aug 15, 2024
wuhuikx pushed a commit to wuhuikx/vllm that referenced this pull request Mar 27, 2025
### What this PR does / why we need it?
In order to adapt the MiniCPM-2B model of the vLLM framework to Ascend
hardware, the following two modifications are required: 1. The qkv type
conversion in the forward function is deleted to support the rope
operator. 2. The fused_moe.fused_moe module is replaced to prevent
errors caused by triton not supported,this modification can be deleted
if triton is supported

### Does this PR introduce _any_ user-facing change?
no
### How was this patch tested?


Signed-off-by: Wang Kunpeng <1289706727@qq.com>
iwooook pushed a commit to moreh-dev/vllm that referenced this pull request Nov 29, 2025
… for Qwen2.5-14B based models (vllm-project#164)

Signed-off-by: Salar Hosseini <skhorasgani@tenstorrent.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants