Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TODO] create_features does not detect rotated images leading to no extractions #15

Open
shabie opened this issue Nov 10, 2021 · 6 comments
Assignees

Comments

@shabie
Copy link
Owner

shabie commented Nov 10, 2021

TODO

I specially like this answer with tesserocr (faster than pytesseract): https://stackoverflow.com/a/69131832/7996306

@uakarsh
Copy link
Collaborator

uakarsh commented Nov 11, 2021

Okay, I would try to include that, instead of pytesseract. Just a side note, did you add the DocFormer implementation, in the paperswithcode.com :)

@uakarsh uakarsh self-assigned this Nov 11, 2021
@uakarsh
Copy link
Collaborator

uakarsh commented Nov 19, 2021

I am trying it, however I face this error https://githubmemory.com/repo/madmaze/pytesseract/issues/368, and I am unable to fix it right now.

@uakarsh
Copy link
Collaborator

uakarsh commented Nov 19, 2021

I am trying it, however, I face this error https://githubmemory.com/repo/madmaze/pytesseract/issues/368, and I am unable to fix it right now.

I would try to fix it in coming days

@shabie
Copy link
Owner Author

shabie commented Nov 19, 2021 via email

@shabie
Copy link
Owner Author

shabie commented Nov 29, 2021

So as an update, despite my best attempts I haven't manage to get the OCR any bit faster...

@uakarsh
Copy link
Collaborator

uakarsh commented Dec 1, 2021

So, uptil then, let us try to work with our old conventional method

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants