You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have tested it with Qwen-1.8B on RTX 2080, and the reasoning acceleration is about twice the time compared to the original (50 tok/s vs ~100 tok/s) which is fascinating. Considering the Owen series has the same architecture, I thought it should be working for Owen-14B.
Qwen-14B: https://github.com/QwenLM/Qwen
The text was updated successfully, but these errors were encountered: