-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update default NNCF configurations #824
Conversation
@nikita-savelyevv, @eaidova, review, please. |
@echarlaix, please do not merge, until the tests for the NNCF configurations wouldn't be added (after discussion with @eaidova). |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
"togethercomputer/RedPajama-INCITE-7B-Instruct": {"bits": 4, "sym": False, "group_size": 128}, | ||
"HuggingFaceH4/zephyr-7b-beta": { | ||
"bits": 4, | ||
"sym": True, | ||
"group_size": 128, | ||
"ratio": 0.8, | ||
"dataset": "wikitext2", | ||
"awq": True, | ||
"quant_method": "awq", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"quant_method": "awq", | |
"quant_method": OVQuantizationMethod.AWQ, |
Here and below
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This requires to replace the OVQuantizationMethod
definition before _DEFAULT_4BIT_CONFIGS
. Do you agree with that change?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This requires to replace the
OVQuantizationMethod
definition before_DEFAULT_4BIT_CONFIGS
. Do you agree with that change?
Yes, sure
@echarlaix, @nikita-savelyevv, review again, please. |
tests/openvino/test_quantization.py
Outdated
@@ -849,6 +849,14 @@ def test_config_from_dict(self, quantization_config: dict, config_type: type, wa | |||
if hasattr(ov_config.quantization_config, k): | |||
self.assertEqual(getattr(ov_config.quantization_config, k), v) | |||
|
|||
@parameterized.expand(_DEFAULT_4BIT_CONFIGS) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you also please include _DEFAULT_4BIT_CONFIG
here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll do that, but the test wouldn't look acceptable then, since _DEFAULT_4BIT_CONFIGS
is a dictionary with the model_id
as key and _DEFAULT_4BIT_CONFIG
is not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this will not work?
@prametrized(_DEFAULT_4BIT_CONFIGS.update({"default": _DEFAULT_4BIT_CONFIG))
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately, no. .update
method doesn't return any value.
@echarlaix, could you review this again, please? |
@echarlaix , can you please help to merge it? |
This reverts commit 31f49a2.
* Add configs from 143530 * Fix wrong AWQ option * Apply comment * Add test * Add missed configuration * Apply comment
* Add configs from 143530 * Fix wrong AWQ option * Apply comment * Add test * Add missed configuration * Apply comment
* Add configs from 143530 * Fix wrong AWQ option * Apply comment * Add test * Add missed configuration * Apply comment
* Add configs from 143530 * Fix wrong AWQ option * Apply comment * Add test * Add missed configuration * Apply comment
* Add configs from 143530 * Fix wrong AWQ option * Apply comment * Add test * Add missed configuration * Apply comment
What does this PR do?