New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Script Fairseq transformer #1620

Closed

cndn wants to merge 2 commits into facebookresearch:master from cndn:export-D19234599

Contributor

cndn commented Jan 14, 2020

Summary:
Make Fairseq transformer scriptable. Discussion points on code quality:

(1) Original decoder output is a tuple (x, {"attn": attn, "inner_states": inner_states}). TorchScript does not support dictionary with values of different types (attn: Tensor, inner_states: List[Tensor]). Current workaround is to use [attn] for attention field and access via output["attn"][0] in downstream. This is currently used in fairspeq custom transformer code. Another (maybe) cleaner alternative is to use namedtuple for decoder output but involves tons of downstream changes too.

(2) Currently TorchScript doesn't support **kwargs. Some unused arguments might get passed in due to polymorphism. Now the only workaround I can think of is to add possible unused arguments, (e.g. line 666 in transformer.py)

Differential Revision: D19234599

facebook-github-bot added the CLA Signed label

Contributor

facebook-github-bot commented Jan 14, 2020

This pull request was exported from Phabricator. Differential Revision: D19234599

cndn force-pushed the export-D19234599 branch from 4c51e8c to 99a57a7 Compare

January 15, 2020 01:52

Contributor

facebook-github-bot commented Jan 15, 2020

This pull request was exported from Phabricator. Differential Revision: D19234599

1 similar comment

Contributor

facebook-github-bot commented Jan 16, 2020

This pull request was exported from Phabricator. Differential Revision: D19234599

cndn force-pushed the export-D19234599 branch from 99a57a7 to 1c0b372 Compare

January 16, 2020 20:02

Contributor

facebook-github-bot commented Jan 28, 2020

This pull request was exported from Phabricator. Differential Revision: D19234599

cndn force-pushed the export-D19234599 branch from 1c0b372 to 326944b Compare

January 28, 2020 21:20

cndn added 2 commits

January 28, 2020 18:43


          Formatting fairseq transformer

61030e3

Differential Revision: D19382299

fbshipit-source-id: 56cbdc3462a19ff87d3c9b8b79878ed3b44806f9


          Script Fairseq transformer (facebookresearch#1011)

dd06781

Summary:
Pull Request resolved: fairinternal/fairseq-py#1011

Pull Request resolved: facebookresearch#1620

Make Fairseq transformer scriptable. Discussion points on possible code refactoring:

(1) Original decoder output is a tuple (x, {"attn": attn, "inner_states": inner_states}). TorchScript does not support dictionary with values of different types (attn: Tensor, inner_states: List[Tensor]). Current workaround is to use [attn] for attention field and access via output["attn"][0] in downstream. This is currently used in fairspeq custom transformer code. Another (maybe) cleaner alternative is to use namedtuple for decoder output but involves tons of downstream changes too.

(2) Currently TorchScript doesn't support **kwargs. Some unused arguments might get passed in due to polymorphism. Now the only workaround I can think of is to add possible unused arguments, (e.g. line 666 in transformer.py)

Differential Revision: D19234599

fbshipit-source-id: 64b8a64995bd2bf9a24f6b0665609a2856dad840

cndn force-pushed the export-D19234599 branch from 326944b to dd06781 Compare

January 29, 2020 02:44

Contributor

facebook-github-bot commented Jan 29, 2020

This pull request was exported from Phabricator. Differential Revision: D19234599

facebook-github-bot closed this in

a07cb6f

facebook-github-bot pushed a commit to pytorch/translate that referenced this pull request


          Script Fairseq transformer (#1011)

c20070b

Summary:
Pull Request resolved: fairinternal/fairseq-py#1011

Pull Request resolved: facebookresearch/fairseq#1620

Make Fairseq transformer scriptable. Discussion points on possible code refactoring:

(1) Original decoder output is a tuple (x, {"attn": attn, "inner_states": inner_states}). TorchScript does not support dictionary with values of different types (attn: Tensor, inner_states: List[Tensor]). Current workaround is to use [attn] for attention field and access via output["attn"][0] in downstream. This is currently used in fairspeq custom transformer code. Another (maybe) cleaner alternative is to use namedtuple for decoder output but involves tons of downstream changes too.

(2) Currently TorchScript doesn't support **kwargs. Some unused arguments might get passed in due to polymorphism. Now the only workaround I can think of is to add possible unused arguments, (e.g. line 666 in transformer.py)

Reviewed By: myleott

Differential Revision: D19234599

fbshipit-source-id: db3dd364ecf3ae14fb7ac8c0928bd0ebe250f19d

Contributor

facebook-github-bot commented Jan 31, 2020

This pull request has been merged in a07cb6f.

facebook-github-bot added the Merged label

jmp84 mentioned this pull request

Export a fairseq trained transformer (base) model #462

Closed

moussaKam pushed a commit to moussaKam/language-adaptive-pretraining that referenced this pull request


          Script Fairseq transformer (facebookresearch#1011)

aca4f5a

Summary:
Pull Request resolved: fairinternal/fairseq-py#1011

Pull Request resolved: facebookresearch#1620

Make Fairseq transformer scriptable. Discussion points on possible code refactoring:

(1) Original decoder output is a tuple (x, {"attn": attn, "inner_states": inner_states}). TorchScript does not support dictionary with values of different types (attn: Tensor, inner_states: List[Tensor]). Current workaround is to use [attn] for attention field and access via output["attn"][0] in downstream. This is currently used in fairspeq custom transformer code. Another (maybe) cleaner alternative is to use namedtuple for decoder output but involves tons of downstream changes too.

(2) Currently TorchScript doesn't support **kwargs. Some unused arguments might get passed in due to polymorphism. Now the only workaround I can think of is to add possible unused arguments, (e.g. line 666 in transformer.py)

Reviewed By: myleott

Differential Revision: D19234599

fbshipit-source-id: db3dd364ecf3ae14fb7ac8c0928bd0ebe250f19d

yzpang pushed a commit to yzpang/gold-off-policy-text-gen-iclr21 that referenced this pull request


          Script Fairseq transformer (#1011)

2966e0a

Summary:
Pull Request resolved: fairinternal/fairseq-py#1011

Pull Request resolved: facebookresearch/fairseq#1620

Make Fairseq transformer scriptable. Discussion points on possible code refactoring:

(1) Original decoder output is a tuple (x, {"attn": attn, "inner_states": inner_states}). TorchScript does not support dictionary with values of different types (attn: Tensor, inner_states: List[Tensor]). Current workaround is to use [attn] for attention field and access via output["attn"][0] in downstream. This is currently used in fairspeq custom transformer code. Another (maybe) cleaner alternative is to use namedtuple for decoder output but involves tons of downstream changes too.

(2) Currently TorchScript doesn't support **kwargs. Some unused arguments might get passed in due to polymorphism. Now the only workaround I can think of is to add possible unused arguments, (e.g. line 666 in transformer.py)

Reviewed By: myleott

Differential Revision: D19234599

fbshipit-source-id: db3dd364ecf3ae14fb7ac8c0928bd0ebe250f19d

yzpang pushed a commit to yzpang/gold-off-policy-text-gen-iclr21 that referenced this pull request


          Script Fairseq transformer (#1011)

2b68eb8

Summary:
Pull Request resolved: fairinternal/fairseq-py#1011

Pull Request resolved: facebookresearch/fairseq#1620

Make Fairseq transformer scriptable. Discussion points on possible code refactoring:

(1) Original decoder output is a tuple (x, {"attn": attn, "inner_states": inner_states}). TorchScript does not support dictionary with values of different types (attn: Tensor, inner_states: List[Tensor]). Current workaround is to use [attn] for attention field and access via output["attn"][0] in downstream. This is currently used in fairspeq custom transformer code. Another (maybe) cleaner alternative is to use namedtuple for decoder output but involves tons of downstream changes too.

(2) Currently TorchScript doesn't support **kwargs. Some unused arguments might get passed in due to polymorphism. Now the only workaround I can think of is to add possible unused arguments, (e.g. line 666 in transformer.py)

Reviewed By: myleott

Differential Revision: D19234599

fbshipit-source-id: db3dd364ecf3ae14fb7ac8c0928bd0ebe250f19d

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment