Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[models] Add text recognition module #4

Closed
4 tasks done
fg-mindee opened this issue Jan 11, 2021 · 1 comment · Fixed by #39
Closed
4 tasks done

[models] Add text recognition module #4

fg-mindee opened this issue Jan 11, 2021 · 1 comment · Fixed by #39
Assignees
Labels
module: models Related to doctr.models
Milestone

Comments

@fg-mindee
Copy link
Contributor

fg-mindee commented Jan 11, 2021

Design a model subpart that is responsible to identify text strings inside the regions of interest of an image

Input

  • images: Numpy-style encoded (cropped) images (already read), expected to hold a single character sequence

Output

  • text: list of N strings, where N = number of cropped input images

The following components would be required:

@fg-mindee fg-mindee added type: enhancement Improvement help wanted Extra attention is needed module: models Related to doctr.models labels Jan 11, 2021
@fg-mindee fg-mindee added this to the 0.1.0 milestone Jan 11, 2021
@fg-mindee fg-mindee removed type: enhancement Improvement help wanted Extra attention is needed labels Jan 12, 2021
@fg-mindee
Copy link
Contributor Author

EDIT: as discussed, the recognition module actually expects (cropped) images that are supposed to have a single character sequence. The higher-level object will take care of using localization information from DetectionPredictor to crop the images and pass it to the RecognitionPredictor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: models Related to doctr.models
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants