bert_module: fix inputs of export model #3815

virajkarandikar · 2022-03-09T11:27:57Z

Exported ONNX model had to be passed with "attention_mask" and
"token_type_ids" inputs swapped to get correct output.

Changing the input order fixes this issue.
Also return a dict instead of list for correctly passing inputs.

What does this PR do ?

Fix swapped model inputs for exported ONNX/TRT model.

Collection: NLP

Changelog

Fix order of inputs returned via input_types()

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests? - Updated related tests.
Did you add or update any necessary documentation? - Not required
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc) - No
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

Bugfix

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

@MaximumEntropy, @ericharper, @ekmb, @yzhang123

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <vkarandikar@nvidia.com>

ryanleary · 2022-03-10T15:12:36Z

Changes look OK to me.

yzhang123 · 2022-03-11T16:50:19Z

@borisfom please double check before we merge

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <vkarandikar@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>

…nto account (#3826) * initial fix: taking data parallel size into consideration Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update the signature Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * data_parallel_rank -> data_parallel_size Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * fix typo Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * cosmetic changes Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * Revert "update the signature" This reverts commit 1c134e5. Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update batch_sampler arguments Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * change how to slice `batch` Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update not drop_last path Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * add fr asr ckpt to doc (#3809) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * bert_module: fix inputs of exported model (#3815) Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <vkarandikar@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> * update random batch sampler Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Viraj Karandikar <16838694+jarivk@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com>

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <vkarandikar@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>

…nto account (#3826) * initial fix: taking data parallel size into consideration Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update the signature Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * data_parallel_rank -> data_parallel_size Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * fix typo Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * cosmetic changes Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * Revert "update the signature" This reverts commit 1c134e5. Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update batch_sampler arguments Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * change how to slice `batch` Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update not drop_last path Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * add fr asr ckpt to doc (#3809) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * bert_module: fix inputs of exported model (#3815) Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <vkarandikar@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> * update random batch sampler Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Viraj Karandikar <16838694+jarivk@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com>

Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <vkarandikar@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>

…nto account (#3826) * initial fix: taking data parallel size into consideration Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update the signature Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * data_parallel_rank -> data_parallel_size Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * fix typo Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * cosmetic changes Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * Revert "update the signature" This reverts commit 1c134e5. Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update batch_sampler arguments Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * change how to slice `batch` Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * update not drop_last path Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> * add fr asr ckpt to doc (#3809) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * bert_module: fix inputs of exported model (#3815) Exported ONNX model had to be passed with "attention_mask" and "token_type_ids" inputs swapped to get correct output. Changing the input order fixes this issue. Also return a dict instead of list for correctly passing inputs. Update relevant tests: test_TokenClassificationModel_export_to_onnx test_PunctuationCapitalizationModel_export_to_onnx test_QAModel_export_to_onnx Signed-off-by: Viraj Karandikar <vkarandikar@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> * update random batch sampler Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Viraj Karandikar <16838694+jarivk@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com>

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch from eaca787 to 2cd9f4f Compare March 9, 2022 11:34

virajkarandikar changed the title ~~Draft: bert_module: fix model inputs when exported~~ bert_module: fix model inputs when exported Mar 9, 2022

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch 2 times, most recently from 2bf0c12 to dcac1ab Compare March 9, 2022 18:44

virajkarandikar marked this pull request as ready for review March 9, 2022 18:45

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch from dcac1ab to 8d275c7 Compare March 9, 2022 18:47

virajkarandikar changed the title ~~bert_module: fix model inputs when exported~~ bert_module: fix inputs of export model Mar 9, 2022

virajkarandikar force-pushed the vkarandikar/fix_bert_model_inputs branch from 8d275c7 to 079e148 Compare March 9, 2022 18:48

Merge branch 'main' into vkarandikar/fix_bert_model_inputs

7da2e04

yzhang123 approved these changes Mar 11, 2022

View reviewed changes

borisfom approved these changes Mar 11, 2022

View reviewed changes

borisfom merged commit 3dd8a5c into NVIDIA:main Mar 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bert_module: fix inputs of export model #3815

bert_module: fix inputs of export model #3815

virajkarandikar commented Mar 9, 2022 •

edited

Loading

ryanleary commented Mar 10, 2022

yzhang123 commented Mar 11, 2022

bert_module: fix inputs of export model #3815

bert_module: fix inputs of export model #3815

Conversation

virajkarandikar commented Mar 9, 2022 • edited Loading

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Who can review?

Additional Information

ryanleary commented Mar 10, 2022

yzhang123 commented Mar 11, 2022

virajkarandikar commented Mar 9, 2022 •

edited

Loading