[Fix] TensorFlow SAR_Resnet31 implementation #925

felixdittrich92 · 2022-05-20T13:25:01Z

This PR: (Still in progress :))

fix SARDecoder forward step
check train
check inference
cleanup / improve code

Any feedback is welcome 🤗

Issue:
#802

felixdittrich92 · 2022-05-23T08:37:17Z

@frgfm
training: Overall it looks good with same config as used for PT implementation it reaches after 17 epochs an exact match from ~70%
inference: i have some problems to translate the torch.scatter_ functionality could you help with this ? :)

doctr/models/recognition/sar/tensorflow.py

codecov · 2022-05-23T10:40:56Z

Codecov Report

Merging #925 (19b287f) into main (0c8dd60) will increase coverage by 0.04%.
The diff coverage is 97.56%.

@@            Coverage Diff             @@
##             main     #925      +/-   ##
==========================================
+ Coverage   94.68%   94.72%   +0.04%     
==========================================
  Files         134      134              
  Lines        5491     5501      +10     
==========================================
+ Hits         5199     5211      +12     
+ Misses        292      290       -2

Flag	Coverage Δ
unittests	`94.72% <97.56%> (+0.04%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/models/classification/resnet/tensorflow.py	`100.00% <ø> (ø)`
doctr/models/recognition/sar/tensorflow.py	`99.25% <97.56%> (+0.87%)`	⬆️
doctr/models/recognition/sar/pytorch.py	`98.50% <0.00%> (-0.02%)`	⬇️
doctr/transforms/functional/base.py	`97.10% <0.00%> (+1.44%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0c8dd60...19b287f. Read the comment docs.

frgfm · 2022-05-25T19:12:59Z

@frgfm training: Overall it looks good with same config as used for PT implementation it reaches after 17 epochs an exact match from ~70% inference: i have some problems to translate the torch.scatter_ functionality could you help with this ? :)

Could you check the test perf against FUNSD or CORD ? :)
cf. bench https://mindee.github.io/doctr/using_doctr/using_models.html#id5

frgfm

Thanks for the PR Felix 🙏
I added a few comments!

doctr/models/recognition/sar/pytorch.py

doctr/models/recognition/sar/tensorflow.py

felixdittrich92 · 2022-05-25T21:05:12Z

@frgfm training: Overall it looks good with same config as used for PT implementation it reaches after 17 epochs an exact match from ~70% inference: i have some problems to translate the torch.scatter_ functionality could you help with this ? :)

Could you check the test perf against FUNSD or CORD ? :) cf. bench https://mindee.github.io/doctr/using_doctr/using_models.html#id5

@frgfm i think this would not be comparable without a full dataset (like mindee intern) :/ I test the implementations currently on a difficult toy dataset 10k train and 2.5k val (contains extrem blurred, rotated images, ~ 30 different fonts and some corrupted characters). Maybe if i found a bit more time i could train it on MJSynth

felixdittrich92 · 2022-05-25T22:37:10Z

@frgfm ok now with kwargs and check 👍 😅

frgfm · 2022-05-30T10:15:11Z

FYI: this PR introduces the deprecation of resnet31 as pretrained model for TF

felixdittrich92 · 2022-05-30T10:47:05Z

FYI: this PR introduces the deprecation of resnet31 as pretrained model for TF

@frgfm
Yes i know, but (correct me if i'm wrong) it is currently only used as backbone for SAR (and master which will change before next release also) so i would say lets include this for next training iteration

trained 1 Epoch on MJSynth Subset : 500k train / 200k val
Validation loss decreased inf --> 0.215485: saving state...
Epoch 1/100 - Validation loss: 0.215485 (Exact: 72.99% | Partial: 77.53%)

FUNSD:
Validation loss: 1.89441 (Exact: 44.94% | Partial: 47.41%)
CORD:
Validation loss: 2.34539 (Exact: 36.30% | Partial: 36.77%)

I would say for a toy run this looks really not bad and we can close the SAR issue with this PR wdyt ? :)

frgfm

Thanks Felix, I added some comments!

doctr/models/recognition/sar/tensorflow.py

frgfm · 2022-05-30T11:31:02Z

FYI: this PR introduces the deprecation of resnet31 as pretrained model for TF

@frgfm Yes i know, but (correct me if i'm wrong) it is currently only used as backbone for SAR (and master which will change before next release also) so i would say lets include this for next training iteration

trained 1 Epoch on MJSynth Subset : 500k train / 200k val Validation loss decreased inf --> 0.215485: saving state... Epoch 1/100 - Validation loss: 0.215485 (Exact: 72.99% | Partial: 77.53%)

FUNSD: Validation loss: 1.89441 (Exact: 44.94% | Partial: 47.41%) CORD: Validation loss: 2.34539 (Exact: 36.30% | Partial: 36.77%)

I would say for a toy run this looks really not bad and we can close the SAR issue with this PR wdyt ? :)

I agree, that's quite good 👍

frgfm

Just a few typos left!

doctr/models/classification/resnet/tensorflow.py

doctr/models/recognition/sar/tensorflow.py

felixdittrich92 · 2022-05-30T13:06:14Z

@frgfm
Are we good ? :)

frgfm

Thanks Felix 🙏

felixdittrich92 self-assigned this May 20, 2022

felixdittrich92 added this to the 0.5.2 milestone May 20, 2022

felixdittrich92 added module: models Related to doctr.models framework: tensorflow Related to TensorFlow backend topic: text recognition Related to the task of text recognition labels May 20, 2022

frgfm mentioned this pull request May 20, 2022

Cannot train pytorch sar_resnet31 and master recognition model #802

Closed

4 tasks

felixdittrich92 force-pushed the fix-sar-tf branch from 0517ff1 to b2510eb Compare May 22, 2022 12:34

felixdittrich92 requested a review from frgfm May 23, 2022 08:33

felixdittrich92 commented May 23, 2022

View reviewed changes

doctr/models/recognition/sar/tensorflow.py Outdated Show resolved Hide resolved

felixdittrich92 changed the title ~~[WIP][Fix] TensorFlow SAR_Resnet31 implementation~~ [Fix] TensorFlow SAR_Resnet31 implementation May 23, 2022

felixdittrich92 marked this pull request as ready for review May 23, 2022 10:21

felixdittrich92 requested a review from charlesmindee May 24, 2022 10:32

felixdittrich92 mentioned this pull request May 24, 2022

Inconsistent references when loading the checkpoint #819

Closed

frgfm reviewed May 25, 2022

View reviewed changes

doctr/models/recognition/sar/pytorch.py Outdated Show resolved Hide resolved

doctr/models/recognition/sar/pytorch.py Outdated Show resolved Hide resolved

doctr/models/recognition/sar/tensorflow.py Outdated Show resolved Hide resolved

felixdittrich92 requested a review from frgfm May 25, 2022 21:39

felixdittrich92 closed this May 29, 2022

felixdittrich92 force-pushed the fix-sar-tf branch from 2516fd7 to 0c8dd60 Compare May 29, 2022 11:36

reopen sar tf

a85ea51

felixdittrich92 reopened this May 29, 2022

remove path to pretrained resnet31

ec2c516

felixdittrich92 added critical High priority type: breaking change Introducing a breaking change labels May 30, 2022

frgfm reviewed May 30, 2022

View reviewed changes

apply suggestions

dcba713

felixdittrich92 requested a review from frgfm May 30, 2022 11:56

frgfm added the type: bug Something isn't working label May 30, 2022

frgfm requested changes May 30, 2022

View reviewed changes

revert url remove and apply suggestions

dc5a168

felixdittrich92 requested a review from frgfm May 30, 2022 12:10

add missing kwargs

19b287f

frgfm approved these changes May 31, 2022

View reviewed changes

frgfm merged commit 75531c5 into mindee:main May 31, 2022

felixdittrich92 deleted the fix-sar-tf branch May 31, 2022 10:38

felixdittrich92 modified the milestones: 0.5.2, 0.6.0 Sep 26, 2022

felixdittrich92 mentioned this pull request Sep 26, 2022

Release tracker - v0.6.0 #791

Closed

85 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] TensorFlow SAR_Resnet31 implementation #925

[Fix] TensorFlow SAR_Resnet31 implementation #925

felixdittrich92 commented May 20, 2022 •

edited

Loading

felixdittrich92 commented May 23, 2022

codecov bot commented May 23, 2022 •

edited

Loading

frgfm commented May 25, 2022

frgfm left a comment

felixdittrich92 commented May 25, 2022

felixdittrich92 commented May 25, 2022

frgfm commented May 30, 2022

felixdittrich92 commented May 30, 2022 •

edited

Loading

frgfm left a comment

frgfm commented May 30, 2022

frgfm left a comment

felixdittrich92 commented May 30, 2022

frgfm left a comment

[Fix] TensorFlow SAR_Resnet31 implementation #925

[Fix] TensorFlow SAR_Resnet31 implementation #925

Conversation

felixdittrich92 commented May 20, 2022 • edited Loading

felixdittrich92 commented May 23, 2022

codecov bot commented May 23, 2022 • edited Loading

Codecov Report

frgfm commented May 25, 2022

frgfm left a comment

Choose a reason for hiding this comment

felixdittrich92 commented May 25, 2022

felixdittrich92 commented May 25, 2022

frgfm commented May 30, 2022

felixdittrich92 commented May 30, 2022 • edited Loading

frgfm left a comment

Choose a reason for hiding this comment

frgfm commented May 30, 2022

frgfm left a comment

Choose a reason for hiding this comment

felixdittrich92 commented May 30, 2022

frgfm left a comment

Choose a reason for hiding this comment

felixdittrich92 commented May 20, 2022 •

edited

Loading

codecov bot commented May 23, 2022 •

edited

Loading

felixdittrich92 commented May 30, 2022 •

edited

Loading