Add type hints for PyTorch Models #16425

karthikrangasai · 2022-03-26T14:36:10Z

What does this PR do?

Add type hints to as many PyTorch models as possible.

This PR targets the following models to type hint entire files:

Albert
Bart
Bert
BertGeneration
BigBird
BigBirdPegasus
Canine
ConvBert
ConvNext
CTRL
Data2VecText
Data2VecAudio
Hubert
Marian
MBart
Nystromformer
Wav2Vec2
WavLM
XGLM
XLMRobertaXL
Yoso

Any other file that has been edited is a result of running make fix-copies.

In the next PR, I will target few other models to type hint complete files.

Fixes #16059

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@Rocketknight1

HuggingFaceDocBuilderDev · 2022-03-26T14:50:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

…to type hint 'config' accordingly for model _init__ method.

…copies'.

…r' failing tests because of Bart methods custom correction overwriting .

karthikrangasai · 2022-03-27T15:14:05Z

Hello all,

I changed the cokkiecutter template code because the check_copies script couldn't correct code that got changed to multiple lines.

Changing a method in Bart model:

##########
## From ##
##########
# Copied from transformers.models.bart.modeling_bart.BartDecoder._prepare_decoder_attention_mask
def _prepare_decoder_attention_mask(self, attention_mask, input_shape, inputs_embeds, past_key_values_length):

########
## To ##
########
# Copied from transformers.models.bart.modeling_bart.BartDecoder._prepare_decoder_attention_mask
def _prepare_decoder_attention_mask(
    self,
    attention_mask: torch.Tensor,
    input_shape: torch.Size,
    inputs_embeds: torch.FloatTensor,
    past_key_values_length: int,
) -> Optional[torch.Tensor]:

The code correcter script would make the following change:

# Copied from transformers.models.bart.modeling_bart.BartDecoder._prepare_decoder_attention_mask
def _prepare_decoder_attention_mask(self, attention_mask, input_shape, inputs_embeds, past_key_values_length):
    self,
    attention_mask: torch.Tensor,
    input_shape: torch.Size,
    inputs_embeds: torch.FloatTensor,
    past_key_values_length: int,
) -> Optional[torch.Tensor]:

Which caused the errors in one of the test runs.

Just commenting the reason here in case someone is taking a look at this PR later.

…copies'.

… that Data2Vec was dependent on, to use 'make fix-copies'.

karthikrangasai · 2022-03-28T21:17:50Z

Hello all,

I have updated the code base with type hints for a few models.
I will open a new PR for the remaining models after this one is merged, since this PR is getting bigger.

Thanks

cc: @Rocketknight1

karthikrangasai · 2022-03-28T21:36:38Z

Hello all,

The Add model like runner test is failing with an ImportError when starting to run the Run all PyTorch modeling test section of the tests with the following error:

ImportError: cannot import name 'get_current_traceback' from 'werkzeug.debug.tbtools' (/home/runner/.local/lib/python3.8/site-packages/werkzeug/debug/tbtools.py)

I am unsure as to what is causing the error and any leads on how to resolve this issue would be appreciated.

Thank you

Rocketknight1 · 2022-03-29T16:54:43Z

Wow, this is a huge PR! Did you do this manually, or have you figured out some kind of tool for it?

karthikrangasai · 2022-03-29T17:35:45Z

Hello @Rocketknight1 ,

Yeah, I made all this manually. This was how I spent my weekend 😛.

Rocketknight1 · 2022-03-30T12:35:39Z

That's amazing! I'll try to review now.

Rocketknight1 · 2022-03-30T14:16:21Z

This is a huge and very impressive PR, thank you! The main suggestion I have is that bools are not annotated in some cases, e.g. output_attentions=False should be output_attentions: bool = False, or output_attentions: Optional[bool] = None when the default is None. I'll try to recruit a couple of people from Huggingface to help me review the whole thing once that's resolved!

karthikrangasai · 2022-03-30T14:22:30Z

Hello,

Sure, I can also a take a look once again to fix the missing ones.
I had some doubts with a few of the types and I will post them here later to get the types for them and later update.

Thanks for the update and glad you liked the work.

Rocketknight1 · 2022-03-30T14:27:22Z

Absolutely! I saw in some cases past was missing annotations - if you're unsure about annotations like that, you can usually check the docstrings for the past or past_key_values argument for that model - it'll be something like Tuple[torch.Tensor]

karthikrangasai · 2022-03-30T15:00:55Z

Sure, later in the process I figured out the type and I had added for a few files. Fill fix for others as well.

Rocketknight1 · 2022-03-30T15:09:50Z

Note that past/past_key_values can have different structure in different models!

karthikrangasai · 2022-03-30T15:32:56Z

Ohhh, thanks for the heads up.

Tegzes · 2022-03-30T17:57:54Z

@karthikrangasai The best way to make sure the type hints are correct is to check the [Model Name]_INPUTS_DOCSTRING, right before the first user interfaced forward method

karthikrangasai · 2022-03-30T18:03:41Z

Hello @Tegzes ,
I checked that for all forward methods. But it might be possible that I missed it for a few files.

I have type hinted the entire file, from first function to last class. So i might have missed something in other places.

Rocketknight1 · 2022-04-01T14:44:30Z

Hi @karthikrangasai ! This is totally my bad - other PRs came in and I reviewed them without realizing they would create conflicts with your one. Would it be possible to break this PR up into a few separate ones and submit them one at a time? That greatly reduces the chances of conflicts for each one, and it'll make it possible for me to add specific comments/suggestions, whereas at this size I really can just give general advice!

karthikrangasai · 2022-04-01T15:02:00Z

Hello @Rocketknight1 ,

Yeah sure.
I would like to completely work on the typing issues if that's fine with you ( for all PyTorch Models - complete file).

I will break the PR into multiple ones based on the corrections made or the model that was type hinted.

Should I close this one then ?

Rocketknight1 · 2022-04-05T12:54:16Z

Hi @karthikrangasai, sorry for the delay! Yeah, it's probably easiest to close this one, make new ones and just tag me in them. Thank you!

Add type hints for PyTorch Models - Albert Model.

38f09c1

karthikrangasai added 10 commits March 26, 2022 20:49

Add type hints for PyTorch Models - BertGeneration Model.

2a079f6

Add type hints for PyTorch Models - BigBird Model. Fix # Copies from …

13a7956

…to type hint 'config' accordingly for model _init__ method.

Add small hints to Bart Model and Biet Model and run 'make fix-copies'.

ec64ab2

Add small type hints to Bert Model and run 'make fix-copies'.

9be9f24

Add type hints for PyTorch Models - BigBirdPegasus Model.

e6e12a1

Add type hints for PyTorch Models - ConvBert Model.

fa0a74c

Add type hints for PyTorch Models - ConvNext Model and run 'make fix-…

2b31160

…copies'.

Add type hints for PyTorch Models - Canine Model.

67371ac

Add type hints for PyTorch Models - CTRL Model.

b5c5827

Fix typing in cookiecutter template that fixed 'Model templates runne…

b5cbf0f

…r' failing tests because of Bart methods custom correction overwriting .

karthikrangasai mentioned this pull request Mar 28, 2022

Add missing type hints #16059

Closed

karthikrangasai added 8 commits March 28, 2022 21:35

Add type hints for PyTorch Models - Wav2Vec2 Model and run 'make fix-…

0367b55

…copies'.

Add type hints for PyTorch Models - Data2Vec Model. Had to fix models…

6a91aea

… that Data2Vec was dependent on, to use 'make fix-copies'.

Add type hints for PyTorch Models - XLMRobertaXL Model.

5a95260

Add type hints for PyTorch Models - Nystromformer Model.

922df18

Add type hints for PyTorch Models - Yoso Model.

0e5a478

Add type hints for PyTorch Models - WavLM Model.

1aa8b28

Add type hints for PyTorch Models - XGLM Model.

9a354b6

Merge branch 'main' into refactor/type_hints_for_models

a6f9dbc

karthikrangasai changed the title ~~[WIP] Add type hints for PyTorch Models~~ Add type hints for PyTorch Models Mar 28, 2022

Merge branch 'main' into refactor/type_hints_for_models

97fa5d8

karthikrangasai added 4 commits March 31, 2022 15:13

Merge branch 'main' into refactor/type_hints_for_models

178d1f9

Add missing types.

894a831

Fix cookiecutter template extra import issue.

10cc363

Merge branch 'main' into refactor/type_hints_for_models

5234b85

karthikrangasai closed this Apr 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add type hints for PyTorch Models #16425

Add type hints for PyTorch Models #16425

karthikrangasai commented Mar 26, 2022 •

edited

HuggingFaceDocBuilderDev commented Mar 26, 2022

karthikrangasai commented Mar 27, 2022

karthikrangasai commented Mar 28, 2022 •

edited

karthikrangasai commented Mar 28, 2022

Rocketknight1 commented Mar 29, 2022

karthikrangasai commented Mar 29, 2022

Rocketknight1 commented Mar 30, 2022

Rocketknight1 commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Rocketknight1 commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Rocketknight1 commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Tegzes commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Rocketknight1 commented Apr 1, 2022

karthikrangasai commented Apr 1, 2022 •

edited

Rocketknight1 commented Apr 5, 2022

Add type hints for PyTorch Models #16425

Add type hints for PyTorch Models #16425

Conversation

karthikrangasai commented Mar 26, 2022 • edited

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Mar 26, 2022

karthikrangasai commented Mar 27, 2022

karthikrangasai commented Mar 28, 2022 • edited

karthikrangasai commented Mar 28, 2022

Rocketknight1 commented Mar 29, 2022

karthikrangasai commented Mar 29, 2022

Rocketknight1 commented Mar 30, 2022

Rocketknight1 commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Rocketknight1 commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Rocketknight1 commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Tegzes commented Mar 30, 2022

karthikrangasai commented Mar 30, 2022

Rocketknight1 commented Apr 1, 2022

karthikrangasai commented Apr 1, 2022 • edited

Rocketknight1 commented Apr 5, 2022

karthikrangasai commented Mar 26, 2022 •

edited

karthikrangasai commented Mar 28, 2022 •

edited

karthikrangasai commented Apr 1, 2022 •

edited