Add QDQBert model and QAT example of SQUAD task #14057

shangz-ai · 2021-10-18T22:05:20Z

What does this PR do?

This PR includes:

Add support of Q/DQ BERT model based on HF BERT model.
(src/transformers/models/qdqbert/)

QDQBERT model add fake quantization operations (pair of QuantizeLinear/DequantizeLinear ops) to:

linear layer inputs and weights
matmul inputs
residual add inputs

in BERT model.

QDQBERT model will be able to load from any checkpoint of HF BERT model, and perform Quantization Aware Training/Post Training Quantization with the support from PyTorch-Quantization toolkit.

Add an example of SQUAD tasks finetuned by the QDQBERT model and inferenced by TensorRT
(examples/pytorch/question-answering/QAT-qdqbert/)

In the example, we use qdqbert model to do Quantization Aware Training from pretrained HF BERT model on SQUAD task. Then TensorRT can run the inference of the generated ONNX model for optimal INT8 performance out-of-the-box.

Also added a module in (examples/pytorch/question-answering/run_qa.py, trainer_qa.py) for saving the SQUAD task specific BERT model as ONNX files, for a consistency check with QAT-qdqbert example.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.

A related discussion on this topic Issue 10639

Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

+ enable save_onnx for QA trainer

shangz-ai · 2021-10-18T23:36:42Z

Hi, this PR includes both the support of QDQBert model and the QAT example of using QDQBert model for SQUAD task.
I'm not sure whether it is the right place to leave the QAT example at examples/pytorch/question-answering/QAT-qdqbert/, since this QAT example will be nicer to compare with regular BERT SQUAD task at examples/pytorch/question-answering/.
Comments are welcome for the QAT example and other parts as well. : ) @LysandreJik

…el' into add-nvidia-qdqbert-model

sgugger

Thanks for your PR. Note that it's hard to review because it include changes from other commits on master (bad rebase?) so it would be better if you could re-open a clean PR from your branch.

Concerning the examples:

I don't think the QAT example should go in the examples maintained by the team, given it introduces a lot of new code no one on the team wrote and will be able to maintain properly. It should go in a research project.
The classic QA example should not be touched by this PR. In general any new functionality should be added to all examples at the same time, which could be done in a separate PR. It's also my understanding that the ONNX conversion won't work for many of the models, but maybe I'm wrong on this.

shangz-ai · 2021-10-19T17:21:39Z

Thanks for your PR. Note that it's hard to review because it include changes from other commits on master (bad rebase?) so it would be better if you could re-open a clean PR from your branch.

Concerning the examples:

I don't think the QAT example should go in the examples maintained by the team, given it introduces a lot of new code no one on the team wrote and will be able to maintain properly. It should go in a research project.

The classic QA example should not be touched by this PR. In general any new functionality should be added to all examples at the same time, which could be done in a separate PR. It's also my understanding that the ONNX conversion won't work for many of the models, but maybe I'm wrong on this.

Thanks for the comments! I'm opening up a new PR here: #14066
based on the latest master branch.
The QAT example now goes into transformers/examples/research_projects/qat-qdqbert/, and the classic QA examples are untouched.

sgugger and others added 30 commits July 26, 2021 10:20

Add doc for v4.9.0

6cab8b3

Fix barrier for SM distributed (huggingface#12853)

8ee16d8

Release: v4.9.1

bff1c71

add qdqbert model and QAT-qdqbert example

fdf6574

+ organize qdqbert model

c542d72

+ enable save_onnx for QA trainer

Update Copyright info

702a48c

qdqbert-test

b016536

qdqbert pretrained model read from bert prefix

199bbf1

update readme file to example

3f16946

readme update

6a08f2b

Update README.md

fac1a63

update TRT evaluation

add21dc

TRT evaluation script update with padding

0199b69

Update README.md to add PTQ evaluation

b2210a1

Update README.md

db1274a

Dockerfile + copyright info

0878eca

Update qdqbert.rst

d449093

Update README.md

7f935f0

Update README.md

6e541f5

Update README.md

cab5cea

Update README.md

57c54b3

Update README.md

4fba3f9

Update README.md

559de6b

Update README.md

f561df0

Update README.md

5c7175d

test qdqbert

3e96a77

integration test for qdqbert

aa8d554

Update qdqbert.rst

4bf7c5f

Update qdqbert.rst

71ba0e8

lint

e5fe179

shangz-ai changed the title ~~Add qdqbert model~~ Add QDQBert model Oct 18, 2021

shangz-ai changed the title ~~Add QDQBert model~~ Add QDQBert model and BERT QAT example Oct 18, 2021

shangz-ai changed the title ~~Add QDQBert model and BERT QAT example~~ Add QDQBert model and QAT example of SQUAD task Oct 18, 2021

shangz-ai and others added 2 commits October 18, 2021 16:20

integration test adding quantization configuration

d8331dd

Update qdqbert.rst

993b288

shangz-ai marked this pull request as ready for review October 18, 2021 23:37

shangz-ai added 2 commits October 18, 2021 17:01

pycuda setup

55a1d85

Merge remote-tracking branch 'remotes/upstream/add-nvidia-qdqbert-mod…

3b77d90

…el' into add-nvidia-qdqbert-model

sgugger reviewed Oct 19, 2021

View reviewed changes

shangz-ai closed this Nov 5, 2021

shangz-ai deleted the add-nvidia-qdqbert-model branch November 19, 2021 18:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add QDQBert model and QAT example of SQUAD task #14057

Add QDQBert model and QAT example of SQUAD task #14057

shangz-ai commented Oct 18, 2021 •

edited

Loading

shangz-ai commented Oct 18, 2021

sgugger left a comment

shangz-ai commented Oct 19, 2021

Add QDQBert model and QAT example of SQUAD task #14057

Add QDQBert model and QAT example of SQUAD task #14057

Conversation

shangz-ai commented Oct 18, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

shangz-ai commented Oct 18, 2021

sgugger left a comment

Choose a reason for hiding this comment

shangz-ai commented Oct 19, 2021

shangz-ai commented Oct 18, 2021 •

edited

Loading