Empty assert hunt #6056

TevenLeScao · 2020-07-27T09:53:40Z

Empty asserts are bad for debugging. I tried to remove them all and to add helpful Pytorch-style messages with the shapes of the corresponding objects when they were mismatched-lengths checks + the file paths when they were file-not-found checks.

codecov · 2020-07-27T10:16:04Z

Codecov Report

Merging #6056 into master will decrease coverage by 0.35%.
The diff coverage is 22.91%.

@@            Coverage Diff             @@
##           master    #6056      +/-   ##
==========================================
- Coverage   78.82%   78.47%   -0.36%     
==========================================
  Files         146      146              
  Lines       26200    26204       +4     
==========================================
- Hits        20653    20564      -89     
- Misses       5547     5640      +93

Impacted Files	Coverage Δ
src/transformers/commands/train.py	`0.00% <ø> (ø)`
src/transformers/data/metrics/__init__.py	`26.66% <0.00%> (ø)`
src/transformers/data/metrics/squad_metrics.py	`0.00% <0.00%> (ø)`
src/transformers/data/processors/utils.py	`27.63% <0.00%> (ø)`
src/transformers/data/processors/xnli.py	`27.08% <0.00%> (-2.47%)`	⬇️
src/transformers/modeling_albert.py	`82.04% <0.00%> (ø)`
src/transformers/modeling_bert.py	`88.39% <0.00%> (ø)`
src/transformers/modeling_electra.py	`81.55% <0.00%> (ø)`
src/transformers/modeling_gpt2.py	`85.88% <0.00%> (ø)`
src/transformers/modeling_mobilebert.py	`89.45% <0.00%> (ø)`
... and 17 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 12f1471...09c7880. Read the comment docs.

mfuntowicz · 2020-07-27T10:50:32Z

I'm in love 😍 😍. Thanks @TevenLeScao !

sgugger

This is great! Thanks a lot for fixing those!
(just one nit in passing)

src/transformers/convert_marian_to_pytorch.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

src/transformers/commands/train.py

TevenLeScao · 2020-07-27T13:59:09Z

src/transformers/convert_marian_to_pytorch.py

        self.state_dict = dict(self.state_dict)
        self.wemb, self.final_bias = add_emb_entries(self.state_dict["Wemb"], self.state_dict[BIAS_KEY], 1)
        self.pad_token_id = self.wemb.shape[0] - 1
        cfg["vocab_size"] = self.pad_token_id + 1
        # self.state_dict['Wemb'].sha
        self.state_keys = list(self.state_dict.keys())
-        if "Wtype" in self.state_dict:
-            raise ValueError("found Wtype key")
+        assert "Wtype" not in self.state_dict, "Wtype key in state dictionary"


Are you sure ? it seems to me that the assert clause activates when the key is in the state dict, not the reverse, so that's what the error message should display

no you're correct. I don't see the value of the change but my suggestion is horrible.

Yeah I was just trying to harmonize things, not much of a change from raising an error to having an assert clause

src/transformers/convert_marian_to_pytorch.py

Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

TevenLeScao · 2020-07-27T13:59:09Z

src/transformers/convert_marian_to_pytorch.py

        self.state_dict = dict(self.state_dict)
        self.wemb, self.final_bias = add_emb_entries(self.state_dict["Wemb"], self.state_dict[BIAS_KEY], 1)
        self.pad_token_id = self.wemb.shape[0] - 1
        cfg["vocab_size"] = self.pad_token_id + 1
        # self.state_dict['Wemb'].sha
        self.state_keys = list(self.state_dict.keys())
-        if "Wtype" in self.state_dict:
-            raise ValueError("found Wtype key")
+        assert "Wtype" not in self.state_dict, "Wtype key in state dictionary"


Are you sure ? it seems to me that the assert clause activates when the key is in the state dict, not the reverse, so that's what the error message should display

TevenLeScao · 2020-08-03T08:11:07Z

@LysandreJik can I merge this?

LysandreJik

Yes, let's go! Thanks @TevenLeScao

@sshleifer

* Fixed empty asserts * black-reformatted stragglers in templates * More code quality checks * Update src/transformers/convert_marian_to_pytorch.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/convert_marian_to_pytorch.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * removed unused line as per @sshleifer Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

Fixed empty asserts

f34fd2c

TevenLeScao requested review from sshleifer, thomwolf, LysandreJik and sgugger July 27, 2020 09:53

TevenLeScao added 2 commits July 27, 2020 11:59

black-reformatted stragglers in templates

5ef34c7

More code quality checks

4a92f36

sgugger approved these changes Jul 27, 2020

View reviewed changes

src/transformers/convert_marian_to_pytorch.py Outdated Show resolved Hide resolved

Update src/transformers/convert_marian_to_pytorch.py

e8db4e3

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

sshleifer approved these changes Jul 27, 2020

View reviewed changes

Update src/transformers/convert_marian_to_pytorch.py

b190b82

Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

TevenLeScao commented Jul 27, 2020

View reviewed changes

TevenLeScao added 2 commits July 27, 2020 16:05

removed unused line as per @sshleifer

91f87fd

Merge remote-tracking branch 'origin/empty_asserts' into empty_asserts

09c7880

LysandreJik approved these changes Aug 3, 2020

View reviewed changes

TevenLeScao merged commit 5a0dac5 into huggingface:master Aug 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Empty assert hunt #6056

Empty assert hunt #6056

TevenLeScao commented Jul 27, 2020 •

edited

Loading

codecov bot commented Jul 27, 2020 •

edited

Loading

mfuntowicz commented Jul 27, 2020

sgugger left a comment

TevenLeScao Jul 27, 2020

sshleifer Jul 27, 2020

TevenLeScao Jul 28, 2020

TevenLeScao Jul 27, 2020

TevenLeScao commented Aug 3, 2020

LysandreJik left a comment

Empty assert hunt #6056

Empty assert hunt #6056

Conversation

TevenLeScao commented Jul 27, 2020 • edited Loading

codecov bot commented Jul 27, 2020 • edited Loading

Codecov Report

mfuntowicz commented Jul 27, 2020

sgugger left a comment

Choose a reason for hiding this comment

TevenLeScao Jul 27, 2020

Choose a reason for hiding this comment

sshleifer Jul 27, 2020

Choose a reason for hiding this comment

TevenLeScao Jul 28, 2020

Choose a reason for hiding this comment

TevenLeScao Jul 27, 2020

Choose a reason for hiding this comment

TevenLeScao commented Aug 3, 2020

LysandreJik left a comment

Choose a reason for hiding this comment

TevenLeScao commented Jul 27, 2020 •

edited

Loading

codecov bot commented Jul 27, 2020 •

edited

Loading