Fix ignore_mismatched_sizes by qqaatw · Pull Request #14085 · huggingface/transformers

qqaatw · 2021-10-20T18:49:49Z

What does this PR do?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@sgugger

sgugger · 2021-10-20T19:09:45Z

As you can see from the multiple failing tests, this makes the use case where we have a model with a task-specific head fail, so it looks like the fix is more complicated than just swapping the lines.

qqaatw · 2021-10-20T19:33:49Z

I believe these failures are due to added AutoModel (TFAutoModel / FlaxAutoModel) test cases. I'm finding an appropriate output of AutoModel that can test mismatched sizes, but it seems that there are some model-specific restrictions or assertions.

sgugger · 2021-10-20T22:01:35Z

Indeed, some of the models are failing because of inner math between the hidden size and others, whereas others fail differently. You should adapt your test to just the vocab_size maybe?

tests/test_modeling_common.py

sgugger · 2021-10-21T11:37:11Z

Thanks for fixing the tests! You should also ignore the test for LayoutLmv2 (who wants a bbox) and there is the encoder-decoder template test to fix, I left a suggestion for that.

qqaatw · 2021-10-21T13:39:41Z

Fixed, thanks for the suggestions.

patrickvonplaten · 2021-10-21T15:27:50Z

src/transformers/modeling_utils.py

@@ -1513,9 +1513,9 @@ def _load_state_dict_into_model(
            for checkpoint_key in loaded_keys:
                model_key = checkpoint_key
                if remove_prefix and checkpoint_key.startswith(prefix):


@sgugger - do you know why we need a and checkpoint_key.startswith(prefix) here? The way I understand it if remove_prefix is True then it's never possible that any loaded key can start with the prefix no?

remove_prefix == True => has_prefix_module == False => checkpoint_key.startswith(prefix) can never be True no? What is the case that I'm missing here?

That's correct. The and should probably be removed.

patrickvonplaten · 2021-10-21T15:30:39Z

src/transformers/modeling_utils.py

-                elif add_prefix:
                    model_key = f"{prefix}.{checkpoint_key}"
+                elif add_prefix:
+                    model_key = ".".join(checkpoint_key.split(".")[1:])


This change looks correct to me!

patrickvonplaten

Change looks correct to me!

@sgugger - IMO it would make sense to rename remove_prefix to remove_prefix_from_init_model to make the code easier to understand..do you agree? Can open a follow-up PR if that's the case

qqaatw · 2021-10-21T15:57:25Z

I agree with @patrickvonplaten. Maybe we can rename add_prefix to add_prefix_to_init_model as well to make it more clear.

sgugger · 2021-10-21T15:59:31Z

I agree with @patrickvonplaten suggestion. Let's merge this PR once the comment is addressed, and then we can do the renaming in a followup PR!

sgugger · 2021-10-21T16:08:29Z

You will need to include this commit in your PR branch as the latest release of PyTorch 1.10 broke our CI.

qqaatw added 7 commits October 21, 2021 18:20

Fix

d1d9f7b

Style

0bc95ae

Name

da7d934

Fix tests

a6ad043

Style

8706f61

Remove embed sizes checking

c929b68

Disable some tests

f163c26

qqaatw force-pushed the fix_ignore_mismatched_size_option branch from 44530c8 to f163c26 Compare October 21, 2021 10:21

sgugger reviewed Oct 21, 2021

View reviewed changes

tests/test_modeling_common.py Show resolved Hide resolved

Fix

a3fdac0

patrickvonplaten reviewed Oct 21, 2021

View reviewed changes

patrickvonplaten approved these changes Oct 21, 2021 •

edited

Loading

View reviewed changes

patrickvonplaten approved these changes Oct 21, 2021

View reviewed changes

Apply suggestion

0e90e14

Merge branch 'master' into fix_ignore_mismatched_size_option

f1c27a2

sgugger merged commit 234cfef into huggingface:master Oct 21, 2021

qqaatw mentioned this pull request Oct 22, 2021

Rename variables with unclear naming #14122

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ignore_mismatched_sizes#14085

Fix ignore_mismatched_sizes#14085
sgugger merged 10 commits intohuggingface:masterfrom
qqaatw:fix_ignore_mismatched_size_option

qqaatw commented Oct 20, 2021

Uh oh!

sgugger commented Oct 20, 2021

Uh oh!

qqaatw commented Oct 20, 2021 •

edited

Loading

Uh oh!

sgugger commented Oct 20, 2021

Uh oh!

Uh oh!

sgugger commented Oct 21, 2021

Uh oh!

qqaatw commented Oct 21, 2021

Uh oh!

patrickvonplaten Oct 21, 2021

Uh oh!

sgugger Oct 21, 2021

Uh oh!

patrickvonplaten Oct 21, 2021

Uh oh!

patrickvonplaten left a comment

Uh oh!

qqaatw commented Oct 21, 2021

Uh oh!

sgugger commented Oct 21, 2021

Uh oh!

sgugger commented Oct 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qqaatw commented Oct 20, 2021

What does this PR do?

Who can review?

Uh oh!

sgugger commented Oct 20, 2021

Uh oh!

qqaatw commented Oct 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger commented Oct 20, 2021

Uh oh!

Uh oh!

sgugger commented Oct 21, 2021

Uh oh!

qqaatw commented Oct 21, 2021

Uh oh!

patrickvonplaten Oct 21, 2021

Choose a reason for hiding this comment

Uh oh!

sgugger Oct 21, 2021

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Oct 21, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

qqaatw commented Oct 21, 2021

Uh oh!

sgugger commented Oct 21, 2021

Uh oh!

sgugger commented Oct 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qqaatw commented Oct 20, 2021 •

edited

Loading