Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

INT4 and other low-precision conversion support status #64193

Open
AIWintermuteAI opened this issue Mar 21, 2024 · 3 comments
Open

INT4 and other low-precision conversion support status #64193

AIWintermuteAI opened this issue Mar 21, 2024 · 3 comments
Assignees
Labels
comp:lite TF Lite related issues ModelOptimizationToolkit TF Model Optimization Toolkit stat:awaiting tensorflower Status - Awaiting response from tensorflower TFLiteConverter For issues related to TFLite converter type:feature Feature requests

Comments

@AIWintermuteAI
Copy link

What is the current status of model conversion (PTQ specifically) with INT4 precision?

The question was raised before here #60125.
Also it looks like INT4 support is being added to various parts of tensorflow, as evidenced by #63870 and

.

However at the moment there seems to be no way to quantize model to INT4 (specifically the weights):

Can anyone who actively works on this in TF team shine the light on what is the current direction and where one needs to dig to add INT4 PTQ quantization?

@AIWintermuteAI AIWintermuteAI added the TFLiteConverter For issues related to TFLite converter label Mar 21, 2024
@LakshmiKalaKadali LakshmiKalaKadali added the comp:lite TF Lite related issues label Mar 27, 2024
@LakshmiKalaKadali
Copy link
Contributor

Hi @AIWintermuteAI,

Please share your .tflite file to reproduce the issue.

Thank You

@LakshmiKalaKadali LakshmiKalaKadali added the stat:awaiting response Status - Awaiting response from author label Mar 27, 2024
Copy link

github-actions bot commented Apr 4, 2024

This issue is stale because it has been open for 7 days with no activity. It will be closed if no further activity occurs. Thank you.

@github-actions github-actions bot added the stale This label marks the issue/pr stale - to be closed automatically if no activity label Apr 4, 2024
@pkgoogle pkgoogle added type:feature Feature requests ModelOptimizationToolkit TF Model Optimization Toolkit and removed stale This label marks the issue/pr stale - to be closed automatically if no activity labels Apr 8, 2024
@pkgoogle
Copy link

pkgoogle commented Apr 8, 2024

Hi @abattery, can you please take a look or comment on this? Thanks.

@google-ml-butler google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label Apr 8, 2024
@pkgoogle pkgoogle added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Apr 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
comp:lite TF Lite related issues ModelOptimizationToolkit TF Model Optimization Toolkit stat:awaiting tensorflower Status - Awaiting response from tensorflower TFLiteConverter For issues related to TFLite converter type:feature Feature requests
Projects
None yet
Development

No branches or pull requests

5 participants