Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CPU] FullyConnected acceleration with 4bit weights decompression #20607

Conversation

dmitry-gorokhov
Copy link
Contributor

@dmitry-gorokhov dmitry-gorokhov commented Oct 19, 2023

Details:

TODO:

  • Some corner cases fixes
  • AMX support (can be done in separate PR)

Dependencies:

Tickets:

@dmitry-gorokhov dmitry-gorokhov added the category: CPU OpenVINO CPU plugin label Oct 19, 2023
@dmitry-gorokhov dmitry-gorokhov added this to the 2023.2 milestone Oct 19, 2023
@dmitry-gorokhov dmitry-gorokhov self-assigned this Oct 19, 2023
@dmitry-gorokhov dmitry-gorokhov requested review from a team as code owners October 19, 2023 11:30
@github-actions github-actions bot added category: inference OpenVINO Runtime library - Inference category: MO Model Optimizer category: Core OpenVINO Core (aka ngraph) category: IE Tests OpenVINO Test: plugins and common category: Python API OpenVINO Python bindings category: transformations OpenVINO Runtime library - Transformations category: preprocessing Inference Engine Preprocessing library category: TEMPLATE OpenVINO Template plugin category: CPP API OpenVINO CPP API bindings category: PyTorch FE OpenVINO PyTorch Frontend labels Oct 19, 2023
@dmitry-gorokhov dmitry-gorokhov marked this pull request as draft October 19, 2023 11:30
@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/int4_weights_decompression_master branch from d96cf54 to c8cc125 Compare October 19, 2023 11:54
@github-actions github-actions bot removed category: MO Model Optimizer category: Python API OpenVINO Python bindings category: TEMPLATE OpenVINO Template plugin labels Oct 19, 2023
@dmitry-gorokhov dmitry-gorokhov removed request for a team and suryasidd October 19, 2023 11:55
@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/int4_weights_decompression_master branch from c8cc125 to 2464497 Compare October 20, 2023 09:24
@github-actions github-actions bot added the category: Core OpenVINO Core (aka ngraph) label Oct 20, 2023
@dmitry-gorokhov dmitry-gorokhov marked this pull request as ready for review October 20, 2023 09:27
@dmitry-gorokhov
Copy link
Contributor Author

@xuchen-intel @v-Golubev guys, could you please review the changes?

@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/int4_weights_decompression_master branch from 2464497 to 0999fc5 Compare October 24, 2023 14:12
@dmitry-gorokhov dmitry-gorokhov requested a review from a team as a code owner October 24, 2023 14:12
@github-actions github-actions bot added category: LP transformations OpenVINO Low Precision transformations and removed category: Core OpenVINO Core (aka ngraph) labels Oct 24, 2023
Copy link
Contributor

@v-Golubev v-Golubev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/int4_weights_decompression_master branch from 0999fc5 to b8eaf9f Compare October 25, 2023 09:36
@github-actions github-actions bot removed category: transformations OpenVINO Runtime library - Transformations category: LP transformations OpenVINO Low Precision transformations labels Oct 25, 2023
@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/int4_weights_decompression_master branch from b8eaf9f to c87a318 Compare October 25, 2023 12:08
@dmitry-gorokhov dmitry-gorokhov merged commit 63299ec into openvinotoolkit:master Oct 25, 2023
65 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: CPU OpenVINO CPU plugin category: IE Tests OpenVINO Test: plugins and common category: inference OpenVINO Runtime library - Inference Code Freeze
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants