Are you planning on making bigger models? #25

francqz31 · 2023-08-03T23:27:06Z

Are there any intensions on making 13B , 30B or 60B kind of models , or any kind of bigger open-source foundation models??

JustinLin610 · 2023-08-31T08:34:14Z

Stay tuned!

jklj077 · 2023-09-26T07:24:31Z

We release Qwen-14B and Qwen-14B-Chat on ModelScope and Hugging Face, along with qwen.cpp and Qwen-Agent. Codes and checkpoints of Qwen-7B and Qwen-7B-Chat are also updated. PLEASE PULL THE LATEST VERSION!

Compared to Qwen-7B (original), Qwen-7B uses more training tokens, increasing from 2.2T tokens to 2.4T tokens, while the context length extends from 2048 to 8192. The Chinese knowledge and coding ability of Qwen-7B have been further improved.

jklj077 closed this as completed Sep 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are you planning on making bigger models? #25

Are you planning on making bigger models? #25

francqz31 commented Aug 3, 2023

JustinLin610 commented Aug 31, 2023

jklj077 commented Sep 26, 2023

Are you planning on making bigger models? #25

Are you planning on making bigger models? #25

Comments

francqz31 commented Aug 3, 2023

JustinLin610 commented Aug 31, 2023

jklj077 commented Sep 26, 2023