size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]). #206

Sridhar-Ranganaboina · 2023-06-06T05:35:48Z

size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]).

Getting this error at line no: 596 in model.py
model = super(DonutModel, cls).from_pretrained(pretrained_model_name_or_path, revision="official", *model_args, **kwargs)

jzsyuan · 2023-06-11T07:20:02Z

ignore_mismatched_sizes=True

gwkrsrch · 2023-06-15T17:31:45Z

Hi, thank you for bringing this issue to our attention. It appears that the problem is likely related to the environment configuration. We will resolve this issue, while also updating the repository accordingly. In the meantime, we kindly request you to refer to the recently updated Google Colab demos and verify that the versions of the essential libraries are in alignment. You can find the necessary information at this link: GitHub Issue Comment

yaoliUoA · 2023-08-31T18:45:25Z

the google colab demo has the same size mismatch issue too.

I have reproduced the error in the "colab-demo-for-donut-base-finetuned-docvqa.ipynb" too.

/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py in _load_pretrained_model(cls, model, state_dict, loaded_keys, resolved_archive_file, pretrained_model_name_or_path, ignore_mismatched_sizes, sharded_metadata, _fast_init, low_cpu_mem_usage, device_map, offload_folder, offload_state_dict, dtype, is_quantized, keep_in_fp32_modules)
3530 "\n\tYou may consider adding ignore_mismatched_sizes=True in the model from_pretrained method."
3531 )
-> 3532 raise RuntimeError(f"Error(s) in loading state_dict for {model.class.name}:\n\t{error_msg}")
3533
3534 if is_quantized:

RuntimeError: Error(s) in loading state_dict for DonutModel:
size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]).
size mismatch for encoder.model.layers.1.downsample.norm.bias: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]).
size mismatch for encoder.model.layers.1.downsample.reduction.weight: copying a param with shape torch.Size([512, 1024]) from checkpoint, the shape in current model is torch.Size([256, 512]).
size mismatch for encoder.model.layers.2.downsample.norm.weight: copying a param with shape torch.Size([2048]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for encoder.model.layers.2.downsample.norm.bias: copying a param with shape torch.Size([2048]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for encoder.model.layers.2.downsample.reduction.weight: copying a param with shape torch.Size([1024, 2048]) from checkpoint, the shape in current model is torch.Size([512, 1024]).
You may consider adding ignore_mismatched_sizes=True in the model from_pretrained method.

ariefwijaya · 2023-09-28T16:19:59Z

any update for this?

crackthedata · 2023-10-04T15:42:57Z

any update on this? Tried multiple versions of timm and transformers and still getting the same error

david-fm · 2023-12-04T15:35:39Z

Be sure to have the proper version:

!pip install transformers==4.25.1
!pip install pytorch-lightning==1.6.4
!pip install timm==0.5.4
!pip install gradio
!pip install donut-python

and compare the app code with the corresponding Google Colab notebook

gwkrsrch mentioned this issue Aug 31, 2023

size mismatch error #242

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]). #206

size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]). #206

Sridhar-Ranganaboina commented Jun 6, 2023

jzsyuan commented Jun 11, 2023

gwkrsrch commented Jun 15, 2023

yaoliUoA commented Aug 31, 2023

ariefwijaya commented Sep 28, 2023

crackthedata commented Oct 4, 2023

david-fm commented Dec 4, 2023

size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]). #206

size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]). #206

Comments

Sridhar-Ranganaboina commented Jun 6, 2023

jzsyuan commented Jun 11, 2023

gwkrsrch commented Jun 15, 2023

yaoliUoA commented Aug 31, 2023

ariefwijaya commented Sep 28, 2023

crackthedata commented Oct 4, 2023

david-fm commented Dec 4, 2023