feat: Added recognition postprocessor with CTC decoder #37

charlesmindee · 2021-01-25T11:13:27Z

Implements CTC decoder with keras backend to decode raw output of CRNN model.
postprocessor input: raw tensor (CRNN output), output: list of words (strings), size = batch_size

codecov · 2021-01-25T11:18:48Z

Codecov Report

Merging #37 (591c114) into main (05d6f4f) will decrease coverage by 0.29%.
The diff coverage is 94.59%.

@@            Coverage Diff             @@
##             main      #37      +/-   ##
==========================================
- Coverage   97.84%   97.54%   -0.30%     
==========================================
  Files          17       18       +1     
  Lines         371      408      +37     
==========================================
+ Hits          363      398      +35     
- Misses          8       10       +2

Impacted Files	Coverage Δ
doctr/models/recognition/postprocessor.py	`94.44% <94.44%> (ø)`
doctr/models/recognition/__init__.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 05d6f4f...591c114. Read the comment docs.

fg-mindee

Thanks for the PR! I left a few comments, but I'm wondering here: if this is specific to CRNN, then we should put specific parts into the crnn.py file. And I feel like your postprocessing function, could be a class (at init, we can already set num_classes, label_to_dict, ignore_case, ignore_accents, so that the call method only takes logits as inputs)

doctr/models/recognition/postprocessor.py

fg-mindee

Thanks for the edits! LGTM

charlesmindee added 8 commits January 21, 2021 17:25

add: crnn

f705f3d

draft crnn

b9ed0a8

conflicts: merged main

13f3f6a

feat: add vgg16 + crnn classes

00f779d

feat: ✨ added ctc decoder fn

40fec07

resolved conflicts

49f338b

add: test ctc decoder

b6385d3

resolving conflicts

7b864da

charlesmindee added the module: models Related to doctr.models label Jan 25, 2021

charlesmindee requested a review from fg-mindee January 25, 2021 11:13

charlesmindee self-assigned this Jan 25, 2021

flake8

ca35897

fg-mindee suggested changes Jan 25, 2021

View reviewed changes

doctr/models/recognition/postprocessor.py Outdated Show resolved Hide resolved

refacto: ♻️ CTCPostProcessor class

591c114

fg-mindee changed the title ~~Postprocess recognition : CTC decoder~~ feat: Added recognition postprocessor with CTC decoder Jan 25, 2021

fg-mindee added this to the 0.1.0 milestone Jan 25, 2021

fg-mindee approved these changes Jan 25, 2021

View reviewed changes

charlesmindee merged commit 3284dca into main Jan 25, 2021

charlesmindee deleted the postprocess_reco branch January 25, 2021 13:26

fg-mindee mentioned this pull request Jan 26, 2021

[models] Add text recognition module #4

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Added recognition postprocessor with CTC decoder #37

feat: Added recognition postprocessor with CTC decoder #37

charlesmindee commented Jan 25, 2021

codecov bot commented Jan 25, 2021 •

edited

Loading

fg-mindee left a comment

fg-mindee left a comment

feat: Added recognition postprocessor with CTC decoder #37

feat: Added recognition postprocessor with CTC decoder #37

Conversation

charlesmindee commented Jan 25, 2021

codecov bot commented Jan 25, 2021 • edited Loading

Codecov Report

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 25, 2021 •

edited

Loading