Remove gidx input from MatMulNBits graph surgery by rM-planet · Pull Request #2278 · microsoft/Olive

rM-planet · 2025-12-08T22:53:48Z

Describe your changes

Adding a graph surgery which will remove group index input from the MatMulNBit nodes, only if the group indexes are sorted.

##Motivation:
MLAS matmulnbits kernel expects g_idx to not be passed if the node was quantized with default column-wise grouping for block-wise quantization. If g_idx is passed, it runs the unpacked compute kernel i.e dequantizes everything to fp32 and triggers a floating point matmul which significantly degrades the runtime performance.

##Impact:
Significant performance improvement for phi4 14b model without accuracy drop.

rM-planet · 2025-12-09T21:37:47Z

@devang-ml @jambayk @gtonpe Requesting you to please review the change.

devang-ml · 2025-12-09T22:30:01Z

Could you please add a unit test? Thanks!

gtonpe · 2025-12-16T01:27:18Z

When are the pending Olive CI tests expected to complete?

rM-planet force-pushed the mlperf_llms branch 2 times, most recently from d63f16a to 60ee7a4 Compare December 9, 2025 21:36

rM-planet force-pushed the mlperf_llms branch from 60ee7a4 to 8bde637 Compare December 12, 2025 00:15

Remove gidx input from MatMulNBits graph surgery

cb57d80

rM-planet force-pushed the mlperf_llms branch from 8bde637 to cb57d80 Compare December 12, 2025 00:32

devang-ml approved these changes Dec 12, 2025

View reviewed changes

xiaoyu-work merged commit 7c5b3b8 into microsoft:main Dec 18, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove gidx input from MatMulNBits graph surgery#2278

Remove gidx input from MatMulNBits graph surgery#2278
xiaoyu-work merged 1 commit intomicrosoft:mainfrom
CodeLinaro:mlperf_llms

rM-planet commented Dec 8, 2025

Uh oh!

rM-planet commented Dec 9, 2025

Uh oh!

devang-ml commented Dec 9, 2025

Uh oh!

gtonpe commented Dec 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rM-planet commented Dec 8, 2025

Describe your changes

Uh oh!

rM-planet commented Dec 9, 2025

Uh oh!

devang-ml commented Dec 9, 2025

Uh oh!

gtonpe commented Dec 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants