Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Are you planning on making bigger models? #25

Closed
francqz31 opened this issue Aug 3, 2023 · 2 comments
Closed

Are you planning on making bigger models? #25

francqz31 opened this issue Aug 3, 2023 · 2 comments

Comments

@francqz31
Copy link

Are there any intensions on making 13B , 30B or 60B kind of models , or any kind of bigger open-source foundation models??

@JustinLin610
Copy link
Member

Stay tuned!

@jklj077
Copy link
Contributor

jklj077 commented Sep 26, 2023

We release Qwen-14B and Qwen-14B-Chat on ModelScope and Hugging Face, along with qwen.cpp and Qwen-Agent. Codes and checkpoints of Qwen-7B and Qwen-7B-Chat are also updated. PLEASE PULL THE LATEST VERSION!

  • Compared to Qwen-7B (original), Qwen-7B uses more training tokens, increasing from 2.2T tokens to 2.4T tokens, while the context length extends from 2048 to 8192. The Chinese knowledge and coding ability of Qwen-7B have been further improved.

@jklj077 jklj077 closed this as completed Sep 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants