-
Notifications
You must be signed in to change notification settings - Fork 13.8k
CANN: Add MROPE and IMROPE support #17401
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Backend CANN0: OK Verified through tests on the qwen2.5VL-7B, qwen3VL-8B, and qwen3VL-30B-A3B models. |
| acl_theta_scale_tensor.get()); | ||
|
|
||
| if (ext_factor != 0) { | ||
| // Step1.2: prepare rope_yarn_ramp, if this part updated, should update theta_scale_tensor. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe this part, use cpu is much chaper. It will cache and calculate only once in most of cases.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
Copilot reviewed 3 out of 3 changed files in this pull request and generated 8 comments.
noemotiovon
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wow! This is truly an impressive project—just understanding the code alone is really brain-intensive! The code looks fine, except that IS_VISION hasn’t been verified yet. I’d like to contribute to that part in the future as well. Finally, thank you so much for your contribution!
1. Optimize the caching logic of rope_cache_init. 2. Add support for mRoPE and i-mRoPE. Note that on Ascend 910B devices, it is necessary to disable FA in CLIP and disable NZ-format conversion. These two issues are still under investigation.
|
@ggerganov @slaren Good day! Could you please review this PR? This change make vl models available on CANN backend. Thanks. |
Note that on Ascend 910B devices, it is necessary to disable FA
in CLIP and disable NZ-format conversion. These two issues are
still under investigation.
Make sure to read the contributing guidelines before submitting a PR