-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Add support for aquila #663
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: ftgreat <ftgreat@163.com>
Signed-off-by: shunxing1234 <xw747777271@gmail.com>
Signed-off-by: shunxing1234 <xw747777271@gmail.com>
In order to merge this PR, do I need to provide any additional information? |
@shunxing1234 Thank you for your great contribution! Can you make sure you pass the format checks and give one example that this model is outputting correct results? |
https://github.com/FlagAI-Open/FlagAI/tree/master/examples/Aquila |
I have been verified the greedy output of transforemers and vLLM on BAAI/AquilaChat-7B last week, they are the same. What needs pay attention to is Aquila has several stop str, such as '###', '', '[UNK]', and I just made a PR to FastChat, after merged we can use openai_api.py on BAAI/AquilaChat-7B. |
@gesanqiu Thanks for your Contribution! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Just tested out BAAI/AquilaChat-7B
and it works great on a single A100. Thank you for your contribution!
@zhuohan123 one typo pr #836 , please review it. thanks. |
* add aquila Signed-off-by: ftgreat <ftgreat@163.com> * fix some bug Signed-off-by: shunxing1234 <xw747777271@gmail.com> * delete pdb Signed-off-by: shunxing1234 <xw747777271@gmail.com> * fix bugs Signed-off-by: shunxing1234 <xw747777271@gmail.com> * fix bugs Signed-off-by: shunxing1234 <xw747777271@gmail.com> * delete whitespace Signed-off-by: shunxing1234 <xw747777271@gmail.com> * format * fix order --------- Signed-off-by: ftgreat <ftgreat@163.com> Signed-off-by: shunxing1234 <xw747777271@gmail.com> Co-authored-by: ftgreat <ftgreat@163.com>
* add aquila Signed-off-by: ftgreat <ftgreat@163.com> * fix some bug Signed-off-by: shunxing1234 <xw747777271@gmail.com> * delete pdb Signed-off-by: shunxing1234 <xw747777271@gmail.com> * fix bugs Signed-off-by: shunxing1234 <xw747777271@gmail.com> * fix bugs Signed-off-by: shunxing1234 <xw747777271@gmail.com> * delete whitespace Signed-off-by: shunxing1234 <xw747777271@gmail.com> * format * fix order --------- Signed-off-by: ftgreat <ftgreat@163.com> Signed-off-by: shunxing1234 <xw747777271@gmail.com> Co-authored-by: ftgreat <ftgreat@163.com>
…oject#663) Changes the sampler used by dummy sequences to greedy if any sequence is using it. Prevents sampler recompilations.
### What this PR does / why we need it? vLLM Ascend side followup on: [Core] Remove prompt string from engine core data structures vllm-project@df6f3ce ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
No description provided.