[FEATURE] Qwen-7B Request #231

152334H · 2023-08-06T21:31:26Z

Qwen-7B is a model that (allegedly) outperforms Llama-2-13B on important benchmarks like MMLU, HumanEval, GSM8K, etc. It has a chat model as well, and is available for commercial use for people with <100Mil monthly active users.

It is architecturally fairly similar to llama, but unfortunately has custom modelling code, and therefore does not immediately work with this repo.

CheshireAI · 2023-08-07T00:44:52Z

qwopqwop200 · 2023-08-08T10:32:44Z

152334H added the enhancement New feature or request label Aug 6, 2023

152334H closed this as completed Aug 8, 2023

Provide feedback