[Fix] add ignore keys if classes differ - KIE training #1271

felixdittrich92 · 2023-07-27T06:35:25Z

This PR:

make it possible to use --pretrained if you use a pretrained det model and train on a KIE dataset
does nothing change for "normal" det training and inference
same way we have handled it for recognition on differing vocabs

Any feedback is welcome 🤗

codecov · 2023-07-27T06:53:18Z

Codecov Report

Merging #1271 (d2ef624) into main (f113530) will increase coverage by 0.00%.
Report is 1 commits behind head on main.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main    #1271   +/-   ##
=======================================
  Coverage   95.75%   95.75%           
=======================================
  Files         154      154           
  Lines        6901     6903    +2     
=======================================
+ Hits         6608     6610    +2     
  Misses        293      293

Flag	Coverage Δ
unittests	`95.75% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
...s/detection/differentiable_binarization/pytorch.py	`97.79% <100.00%> (+0.01%)`	⬆️
doctr/models/detection/linknet/pytorch.py	`98.19% <100.00%> (+0.01%)`	⬆️

... and 1 file with indirect coverage changes

odulcy-mindee · 2023-07-27T14:05:21Z

doctr/models/detection/differentiable_binarization/pytorch.py

+        # The number of class_names is not the same as the number of classes in the pretrained model =>
+        # remove the layer weights


Can you elaborate ?

Yes, just like with the recognition models, if we want to fine-tune a pre-trained model on another vocab we have to reset (resp. resize embeddings) the classifier head as well as the embeddings, otherwise it leads to a shape mismatch.
We have the same thing now when training on a KIE data set, since the number of classes varies, before that in normal detection training it is always one class (text) so it didn't matter.

This will reset the Conv"head" which makes it also possible to use '--pretrained' for KIE detection training without an impact on the normal detection training and inference otherwise '--pretrained' raises a shape mismatch while init the model with the pretrained checkpoint

as example for crnn:

if pretrained: # The number of classes is not the same as the number of classes in the pretrained model => # remove the last layer weights _ignore_keys = ignore_keys if _cfg["vocab"] != default_cfgs[arch]["vocab"] else None load_pretrained_params(model, _cfg["url"], ignore_keys=_ignore_keys) ignore_keys=["linear.weight", "linear.bias"],

Super clear, thanks

add ignore keys if classes differ

d2ef624

felixdittrich92 self-assigned this Jul 27, 2023

felixdittrich92 added type: bug Something isn't working module: models Related to doctr.models framework: pytorch Related to PyTorch backend topic: text detection Related to the task of text detection labels Jul 27, 2023

felixdittrich92 added this to the 0.7.0 milestone Jul 27, 2023

felixdittrich92 requested review from charlesmindee and odulcy-mindee July 27, 2023 06:53

odulcy-mindee reviewed Jul 27, 2023

View reviewed changes

odulcy-mindee approved these changes Jul 27, 2023

View reviewed changes

felixdittrich92 merged commit 416c639 into mindee:main Jul 27, 2023
58 checks passed

felixdittrich92 deleted the fix-kie-pretrained branch July 27, 2023 15:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] add ignore keys if classes differ - KIE training #1271

[Fix] add ignore keys if classes differ - KIE training #1271

felixdittrich92 commented Jul 27, 2023

codecov bot commented Jul 27, 2023

odulcy-mindee Jul 27, 2023

felixT2K Jul 27, 2023

felixT2K Jul 27, 2023

felixT2K Jul 27, 2023

odulcy-mindee Jul 27, 2023

		# The number of class_names is not the same as the number of classes in the pretrained model =>
		# remove the layer weights

[Fix] add ignore keys if classes differ - KIE training #1271

[Fix] add ignore keys if classes differ - KIE training #1271

Conversation

felixdittrich92 commented Jul 27, 2023

codecov bot commented Jul 27, 2023

Codecov Report

odulcy-mindee Jul 27, 2023

Choose a reason for hiding this comment

felixT2K Jul 27, 2023

Choose a reason for hiding this comment

felixT2K Jul 27, 2023

Choose a reason for hiding this comment

felixT2K Jul 27, 2023

Choose a reason for hiding this comment

odulcy-mindee Jul 27, 2023

Choose a reason for hiding this comment