New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optical character recognition (OCR) of text in freeform fields #15
Comments
Oh, I somehow expected this to come up at some point ... thanks for pointing out gamera, I did not know about it. Some thoughts of what needs to be done for this:
Gamera seems like a good starting point, but this whole thing is quite a big chunk to get it working. I have no idea when or even how much time I can spend on this myself ... |
Step 1. is done. There is now an "ocr" branch, that has support for rendering text fields that are optimized for OCR. Obviously the rest of the program does not understand this yet. |
Cool! Sent from my iPhone On Feb 23, 2013, at 10:09, Benjamin Berg notifications@github.com wrote:
|
I did some work on the branch to interface with gamera. I don't seem to entirely understand it right now, but there is hope :-) It seems to me that the default grouping doesn't work, but SDAPS can do grouping by itself. Also, for a start it seems easier to do the training using the gamera_gui program instead of something custom. Important next steps:
Fun side fact: gamera seems to store the image+original location into the XML file for each character; I guess some munging could be necessary for privacy reasons if one wants to share the training data. Otherwise the original strings could be build from the training data. |
Hi, |
Unless something unexpected happens, it is unlikely that I will work on this anytime soon. So, I wouldn't hold my breath for this to happen. |
Unexpected would for likely mean some external code contribution, or if someone really wants this paid development work (either by me or a third party). |
Hi, considering paid development is an option, is it possible for you (or a third party) to estimate remaining work to achieve this feature ? Thank you. |
Hi, It is hard to say how much work it is overall. I expect that there is a You might want also want to talk to Matthew Roy, see I'll think about the matter some more the next days/week (i.e. how much Benjamin |
Hi, I will be very interested by your evaluation, please keep me informed. SDAPS seems to be a very good start point to build a complete solution to meet needs of one of my customers but I absolutely need OCR in addition of OMR to be complete. Another option could be to take over ocr branch code to try to achieve it but certainly not the best in time and money terms in my context. Thank you. |
Is there any light in a tunnel? |
Is there any light in a tunnel?
Not really. Unless there is a ready to use tool that could be integrated. It feels like it shouldn't be that hard these days with all the neural network advancements to build something basic at least ...
But well, it would need someone doing the work. I can help with integration, but not much more, I think.
|
I'd like to see an enhancement added to the SDAPS project that would allow optical character recognition (OCR) of the text in freeform fields.
The Gamera Project may be a good place to start:
http://gamera.informatik.hsnr.de/
The text was updated successfully, but these errors were encountered: