New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CPU] FullyConnected acceleration with 4bit weights decompression #20607
Merged
dmitry-gorokhov
merged 4 commits into
openvinotoolkit:master
from
dmitry-gorokhov:feature/int4_weights_decompression_master
Oct 25, 2023
Merged
[CPU] FullyConnected acceleration with 4bit weights decompression #20607
dmitry-gorokhov
merged 4 commits into
openvinotoolkit:master
from
dmitry-gorokhov:feature/int4_weights_decompression_master
Oct 25, 2023
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
github-actions
bot
added
category: inference
OpenVINO Runtime library - Inference
category: MO
Model Optimizer
category: Core
OpenVINO Core (aka ngraph)
category: IE Tests
OpenVINO Test: plugins and common
category: Python API
OpenVINO Python bindings
category: transformations
OpenVINO Runtime library - Transformations
category: preprocessing
Inference Engine Preprocessing library
category: TEMPLATE
OpenVINO Template plugin
category: CPP API
OpenVINO CPP API bindings
category: PyTorch FE
OpenVINO PyTorch Frontend
labels
Oct 19, 2023
dmitry-gorokhov
force-pushed
the
feature/int4_weights_decompression_master
branch
from
October 19, 2023 11:54
d96cf54
to
c8cc125
Compare
github-actions
bot
removed
category: MO
Model Optimizer
category: Python API
OpenVINO Python bindings
category: TEMPLATE
OpenVINO Template plugin
labels
Oct 19, 2023
dmitry-gorokhov
force-pushed
the
feature/int4_weights_decompression_master
branch
from
October 20, 2023 09:24
c8cc125
to
2464497
Compare
@xuchen-intel @v-Golubev guys, could you please review the changes? |
v-Golubev
reviewed
Oct 20, 2023
src/plugins/intel_cpu/src/transformations/transformation_pipeline.cpp
Outdated
Show resolved
Hide resolved
xuchen-intel
requested changes
Oct 24, 2023
...mations/src/transformations/common_optimizations/convert_u4_weights_zero_point_to_scalar.cpp
Outdated
Show resolved
Hide resolved
...mations/src/transformations/common_optimizations/convert_u4_weights_zero_point_to_scalar.cpp
Outdated
Show resolved
Hide resolved
...ommon/transformations/tests/common_optimizations/convert_u4_weights_zero_point_to_scalar.cpp
Outdated
Show resolved
Hide resolved
dmitry-gorokhov
force-pushed
the
feature/int4_weights_decompression_master
branch
from
October 24, 2023 14:12
2464497
to
0999fc5
Compare
github-actions
bot
added
category: LP transformations
OpenVINO Low Precision transformations
and removed
category: Core
OpenVINO Core (aka ngraph)
labels
Oct 24, 2023
v-Golubev
approved these changes
Oct 24, 2023
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
src/plugins/intel_cpu/src/transformations/transformation_pipeline.cpp
Outdated
Show resolved
Hide resolved
xuchen-intel
approved these changes
Oct 25, 2023
dmitry-gorokhov
force-pushed
the
feature/int4_weights_decompression_master
branch
from
October 25, 2023 09:36
0999fc5
to
b8eaf9f
Compare
github-actions
bot
removed
category: transformations
OpenVINO Runtime library - Transformations
category: LP transformations
OpenVINO Low Precision transformations
labels
Oct 25, 2023
dmitry-gorokhov
force-pushed
the
feature/int4_weights_decompression_master
branch
from
October 25, 2023 12:08
b8eaf9f
to
c87a318
Compare
alvoron
pushed a commit
to alvoron/openvino
that referenced
this pull request
Nov 6, 2023
allnes
pushed a commit
to allnes/openvino
that referenced
this pull request
Nov 23, 2023
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
category: CPU
OpenVINO CPU plugin
category: IE Tests
OpenVINO Test: plugins and common
category: inference
OpenVINO Runtime library - Inference
Code Freeze
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Details:
-- Supported cases:
--- nodes: FullyConnected
--- weights compression: i4/u4/nf4
--- isa: avx2, avx512
TODO:
Dependencies:
Tickets: