Skip to content

[Snippets][CPU] Moved weight repacking of Brgemm to CreatePrimitive stage#29672

Merged
v-Golubev merged 8 commits intoopenvinotoolkit:masterfrom
a-sidorova:feature/snippets/weight_repacking
Mar 31, 2025
Merged

[Snippets][CPU] Moved weight repacking of Brgemm to CreatePrimitive stage#29672
v-Golubev merged 8 commits intoopenvinotoolkit:masterfrom
a-sidorova:feature/snippets/weight_repacking

Conversation

@a-sidorova
Copy link
Contributor

@a-sidorova a-sidorova commented Mar 25, 2025

Details:

  • To reduce overheads in runtime, we moved repacking of second constant input of Brgemm from inference stage to model compilation stage.
  • Introduced helper utils::get_planar_layout

Tickets:

@a-sidorova a-sidorova added this to the 2025.2 milestone Mar 25, 2025
@github-actions github-actions bot added the category: CPU OpenVINO CPU plugin label Mar 25, 2025
@a-sidorova a-sidorova force-pushed the feature/snippets/weight_repacking branch 5 times, most recently from e1f3f08 to 425110b Compare March 25, 2025 08:20
@a-sidorova a-sidorova marked this pull request as ready for review March 25, 2025 08:20
@a-sidorova a-sidorova requested review from a team as code owners March 25, 2025 08:20
@a-sidorova a-sidorova force-pushed the feature/snippets/weight_repacking branch from 425110b to 82af9da Compare March 25, 2025 10:08
Comment on lines 23 to -22
auto m_rank_norm = ov::pass::pattern::optional<ov::snippets::op::RankNormalization>(m_param);
auto m_copy_b = ov::pass::pattern::wrap_type<BrgemmCopyB>({m_param});
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hm, it seems like there is some code style check issue: m_rank_norm was unused, but there were no indications
Anyway, thanks for the fix :)

Copy link
Contributor

@v-Golubev v-Golubev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good job 👍

a-sidorova and others added 2 commits March 28, 2025 15:03
Co-authored-by: Vladislav Golubev <vladislav.golubev@intel.com>
Co-authored-by: Vladislav Golubev <vladislav.golubev@intel.com>
@v-Golubev v-Golubev enabled auto-merge March 31, 2025 07:29
@v-Golubev v-Golubev added this pull request to the merge queue Mar 31, 2025
Merged via the queue into openvinotoolkit:master with commit 05ffd7f Mar 31, 2025
188 checks passed
@v-Golubev v-Golubev deleted the feature/snippets/weight_repacking branch March 31, 2025 14:15
sanleo-wq pushed a commit to sanleo-wq/openvino that referenced this pull request Apr 1, 2025
…tage (openvinotoolkit#29672)

### Details:
- *To reduce overheads in runtime, we moved repacking of second constant
input of `Brgemm` from inference stage to model compilation stage.*
 - *Introduced helper `utils::get_planar_layout`*
 
### Tickets:
 - *CVS-164529*

---------

Co-authored-by: Vladislav Golubev <vladislav.golubev@intel.com>
timxu826 pushed a commit to timxu826/openvino that referenced this pull request Apr 7, 2025
…tage (openvinotoolkit#29672)

### Details:
- *To reduce overheads in runtime, we moved repacking of second constant
input of `Brgemm` from inference stage to model compilation stage.*
 - *Introduced helper `utils::get_planar_layout`*
 
### Tickets:
 - *CVS-164529*

---------

Co-authored-by: Vladislav Golubev <vladislav.golubev@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: CPU OpenVINO CPU plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants