Skip to content

add int4 quantization code in python#17077

Merged
yufenglee merged 6 commits into
microsoft:mainfrom
chenfucn:cfu_i4
Aug 11, 2023
Merged

add int4 quantization code in python#17077
yufenglee merged 6 commits into
microsoft:mainfrom
chenfucn:cfu_i4

Conversation

@chenfucn
Copy link
Copy Markdown
Contributor

@chenfucn chenfucn commented Aug 9, 2023

Description

Adding int4 quantization code in python

Motivation and Context

Python quantization tool no-longer needs to invoke shell to call a native exe

Comment thread onnxruntime/python/tools/quantization/matmul_weight4_quantizer.py Fixed
Comment thread onnxruntime/test/python/quantization/test_op_matmulfpq4.py Fixed
Comment thread onnxruntime/test/python/quantization/test_op_matmulfpq4.py Fixed
Comment thread onnxruntime/test/python/quantization/test_op_matmulfpq4.py Fixed
Comment thread onnxruntime/python/tools/quantization/matmul_weight4_quantizer.py Fixed
Copy link
Copy Markdown
Member

@yufenglee yufenglee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@zjc664656505
Copy link
Copy Markdown

May I know how whether there is a tutorial for running the int4 qunatization with onnx?

@yufenglee yufenglee merged commit f2e1b91 into microsoft:main Aug 11, 2023
jchen351 pushed a commit that referenced this pull request Aug 12, 2023
### Description
Adding int4 quantization code in python


### Motivation and Context
Python quantization tool no-longer needs to invoke shell to call a
native exe
@chenfucn chenfucn deleted the cfu_i4 branch August 14, 2023 16:14
kleiti pushed a commit to kleiti/onnxruntime that referenced this pull request Mar 22, 2024
### Description
Adding int4 quantization code in python


### Motivation and Context
Python quantization tool no-longer needs to invoke shell to call a
native exe
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants