We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
如题,会支持vllm吗?会有更快的推理速度。 https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/yi.py
我尝试参照此改写,发现有num_kv_heads 和 share_kv_heads_num 的差异,我尝试改写加载参数时会报错。
他官方的添加方式是这个,https://docs.vllm.ai/en/latest/models/adding_model.html
The text was updated successfully, but these errors were encountered:
YAYI和YAYI2都已成功适配vllm加速,我们不久会把代码提交到vllm仓库。
Sorry, something went wrong.
暂未在vllm的Pull requests中找到,期待您的提交
No branches or pull requests
如题,会支持vllm吗?会有更快的推理速度。
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/yi.py
我尝试参照此改写,发现有num_kv_heads 和 share_kv_heads_num 的差异,我尝试改写加载参数时会报错。
他官方的添加方式是这个,https://docs.vllm.ai/en/latest/models/adding_model.html
The text was updated successfully, but these errors were encountered: