CANN: Add MROPE and IMROPE support #17401

hipudding · 2025-11-20T09:21:51Z

Optimize the caching logic of rope_cache_init.
Add support for mRoPE and i-mRoPE.

Note that on Ascend 910B devices, it is necessary to disable FA
in CLIP and disable NZ-format conversion. These two issues are
still under investigation.

Make sure to read the contributing guidelines before submitting a PR

hipudding · 2025-11-21T07:10:26Z

Backend CANN0: OK
Backend 2/5: CANN1
Skipping
Backend 3/5: CANN2
Skipping
Backend 4/5: CANN3
Skipping
Backend 5/5: CPU
Skipping
5/5 backends passed
OK

Verified through tests on the qwen2.5VL-7B, qwen3VL-8B, and qwen3VL-30B-A3B models.

hipudding · 2025-11-21T07:20:22Z

ggml/src/ggml-cann/aclnn_ops.cpp

-                                acl_theta_scale_tensor.get());
-
-        if (ext_factor != 0) {
+    // Step1.2: prepare rope_yarn_ramp, if this part updated, should update theta_scale_tensor.


Maybe this part, use cpu is much chaper. It will cache and calculate only once in most of cases.

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Copilot

Pull Request Overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 8 comments.

ggml/src/ggml-cann/aclnn_ops.cpp

noemotiovon

Wow! This is truly an impressive project—just understanding the code alone is really brain-intensive! The code looks fine, except that IS_VISION hasn’t been verified yet. I’d like to contribute to that part in the future as well. Finally, thank you so much for your contribution!

ggml/src/ggml-cann/ggml-cann.cpp

ggml/src/ggml-cann/aclnn_ops.cpp

1. Optimize the caching logic of rope_cache_init. 2. Add support for mRoPE and i-mRoPE. Note that on Ascend 910B devices, it is necessary to disable FA in CLIP and disable NZ-format conversion. These two issues are still under investigation.

hipudding · 2025-11-26T01:19:20Z

@ggerganov @slaren Good day! Could you please review this PR? This change make vl models available on CANN backend. Thanks.

github-actions bot added testing Everything test related ggml changes relating to the ggml tensor library for machine learning Ascend NPU issues specific to Ascend NPUs labels Nov 20, 2025

hipudding force-pushed the vl branch from b2bc76c to 3a4f8fb Compare November 21, 2025 07:08

hipudding marked this pull request as ready for review November 21, 2025 07:10

hipudding removed the testing Everything test related label Nov 21, 2025

hipudding commented Nov 21, 2025

View reviewed changes

hipudding requested a review from Copilot November 21, 2025 07:26

loci-dev mentioned this pull request Nov 21, 2025

UPSTREAM PR #17401: CANN: Add MROPE and IMROPE support auroralabs-loci/llama.cpp#277

Open

Copilot AI reviewed Nov 21, 2025

View reviewed changes

Copilot started reviewing on behalf of hipudding November 21, 2025 07:43 View session

Copilot finished reviewing on behalf of hipudding November 21, 2025 07:44

hipudding requested a review from Copilot November 21, 2025 07:56

Copilot started reviewing on behalf of hipudding November 21, 2025 07:56 View session

Copilot finished reviewing on behalf of hipudding November 21, 2025 07:59

Copilot AI reviewed Nov 21, 2025

View reviewed changes

noemotiovon approved these changes Nov 21, 2025

View reviewed changes

ggml/src/ggml-cann/ggml-cann.cpp Show resolved Hide resolved

ggml/src/ggml-cann/ggml-cann.cpp Show resolved Hide resolved

ggml/src/ggml-cann/aclnn_ops.cpp Show resolved Hide resolved

ggml/src/ggml-cann/aclnn_ops.cpp Show resolved Hide resolved

hipudding force-pushed the vl branch from eca0f8e to a8aea69 Compare November 24, 2025 02:51

hipudding added 2 commits November 24, 2025 02:55

CANN: ROPE supports both MROPE and IMROPE.

9225edf

1. Optimize the caching logic of rope_cache_init. 2. Add support for mRoPE and i-mRoPE. Note that on Ascend 910B devices, it is necessary to disable FA in CLIP and disable NZ-format conversion. These two issues are still under investigation.

Resolve review comments

8dd5470

hipudding force-pushed the vl branch from a8aea69 to 8dd5470 Compare November 24, 2025 02:55

hipudding requested review from ggerganov and slaren November 24, 2025 03:40

ggerganov approved these changes Nov 26, 2025

View reviewed changes

hipudding merged commit eeb5605 into ggml-org:master Nov 26, 2025
77 checks passed

CANN: Add MROPE and IMROPE support #17401

CANN: Add MROPE and IMROPE support #17401

Uh oh!

Conversation

hipudding commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hipudding commented Nov 21, 2025

Uh oh!

hipudding Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

noemotiovon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hipudding commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hipudding commented Nov 20, 2025 •

edited

Loading