Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Qwen-7B Request #231

Closed
152334H opened this issue Aug 6, 2023 · 2 comments
Closed

[FEATURE] Qwen-7B Request #231

152334H opened this issue Aug 6, 2023 · 2 comments
Labels
enhancement New feature or request

Comments

@152334H
Copy link

152334H commented Aug 6, 2023

Qwen-7B is a model that (allegedly) outperforms Llama-2-13B on important benchmarks like MMLU, HumanEval, GSM8K, etc. It has a chat model as well, and is available for commercial use for people with <100Mil monthly active users.

It is architecturally fairly similar to llama, but unfortunately has custom modelling code, and therefore does not immediately work with this repo.

@152334H 152334H added the enhancement New feature or request label Aug 6, 2023
@CheshireAI
Copy link

I'd also be thrilled with this. I don't know if it helps but they have it implemented in these projects:
https://github.com/cckuailong/SuperAdapters
https://github.com/yangjianxin1/Firefly
https://github.com/ssbuild/qwen_finetuning

@qwopqwop200
Copy link
Collaborator

#240

@152334H 152334H closed this as completed Aug 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants