Skip to content

Conversation

@ochougul
Copy link
Contributor

@ochougul ochougul commented Sep 4, 2024

Closed 91 for GPTQ PR to be up.

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
…code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
@ochougul ochougul merged commit 29dc049 into quic:awq+gptq Sep 4, 2024
ochougul added a commit that referenced this pull request Sep 4, 2024
* added preprocess layer before loading quantized awq weights

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added onnx export

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added ScaledActivation class

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* refactoring the code to right places and added one single test for now

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added proper tests, added decorator for updating quantizers, cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed CLI

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added auto file for decorator

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
ochougul added a commit that referenced this pull request Sep 10, 2024
* added preprocess layer before loading quantized awq weights

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added onnx export

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added ScaledActivation class

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* refactoring the code to right places and added one single test for now

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added proper tests, added decorator for updating quantizers, cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed CLI

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added auto file for decorator

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
ochougul added a commit that referenced this pull request Sep 13, 2024
* added preprocess layer before loading quantized awq weights

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added onnx export

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added ScaledActivation class

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* refactoring the code to right places and added one single test for now

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added proper tests, added decorator for updating quantizers, cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed CLI

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added auto file for decorator

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
ochougul added a commit that referenced this pull request Sep 13, 2024
* added preprocess layer before loading quantized awq weights

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added onnx export

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added ScaledActivation class

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* refactoring the code to right places and added one single test for now

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added proper tests, added decorator for updating quantizers, cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed CLI

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added auto file for decorator

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
ochougul added a commit that referenced this pull request Sep 13, 2024
* Awq feature (#100)

* added preprocess layer before loading quantized awq weights

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added onnx export

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added ScaledActivation class

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* refactoring the code to right places and added one single test for now

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added proper tests, added decorator for updating quantizers, cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed CLI

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added auto file for decorator

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* bugfix for tests

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed tests for AWQ model

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* Adding support for GPTQ models (#103)

* Adding support for gptq models

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Code cleaning and formating

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* ruff format and fixed some bug

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Added tests for gptq

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Bug-fix-1

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bugs-2

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bug-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Added docstring

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bugs-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* ruff check and format

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

---------

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added liscence at top for missing file

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added export_and_compile and fixed bugs

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* removed GPTQ test

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* removed threading from pytest

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
Co-authored-by: Amit Raj <168538872+quic-amitraj@users.noreply.github.com>
quic-amitraj added a commit to quic-amitraj/efficient-transformers that referenced this pull request Sep 16, 2024
* Awq feature (quic#100)

* added preprocess layer before loading quantized awq weights

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added onnx export

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added ScaledActivation class

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* refactoring the code to right places and added one single test for now

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added proper tests, added decorator for updating quantizers, cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed CLI

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added auto file for decorator

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* bugfix for tests

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed tests for AWQ model

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* Adding support for GPTQ models (quic#103)

* Adding support for gptq models

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Code cleaning and formating

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* ruff format and fixed some bug

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Added tests for gptq

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Bug-fix-1

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bugs-2

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bug-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Added docstring

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bugs-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* ruff check and format

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

---------

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added liscence at top for missing file

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added export_and_compile and fixed bugs

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* removed GPTQ test

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* removed threading from pytest

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
Co-authored-by: Amit Raj <168538872+quic-amitraj@users.noreply.github.com>
Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
quic-amitraj added a commit to quic-amitraj/efficient-transformers that referenced this pull request Sep 16, 2024
* Awq feature (quic#100)

* added preprocess layer before loading quantized awq weights

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added onnx export

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added ScaledActivation class

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* refactoring the code to right places and added one single test for now

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added proper tests, added decorator for updating quantizers, cleaned code

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed CLI

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added auto file for decorator

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* bugfix for tests

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* fixed tests for AWQ model

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* Adding support for GPTQ models (quic#103)

* Adding support for gptq models

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Code cleaning and formating

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* ruff format and fixed some bug

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Added tests for gptq

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Bug-fix-1

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bugs-2

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bug-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Added docstring

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* fixed bugs-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* ruff check and format

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

* Addressed comments-3

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>

---------

Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added liscence at top for missing file

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* added export_and_compile and fixed bugs

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* removed GPTQ test

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

* removed threading from pytest

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>

---------

Signed-off-by: Onkar Chougule <quic_ochougul@quicinc.com>
Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
Co-authored-by: Amit Raj <168538872+quic-amitraj@users.noreply.github.com>
Signed-off-by: Amit Raj <quic_amitraj@quicinc.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant