Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow other OCR engines for adding recipes via the Scan feature #359

Open
mblennegard opened this issue Jun 13, 2024 · 2 comments
Open

Allow other OCR engines for adding recipes via the Scan feature #359

mblennegard opened this issue Jun 13, 2024 · 2 comments
Labels
enhancement New feature or request frontend Go HTML templates go Pull requests that update Go code
Milestone

Comments

@mblennegard
Copy link
Contributor

mblennegard commented Jun 13, 2024

Is your feature request related to a problem? Please describe.
Instead of having to rely on a cloud service, e.g. using Azure AI Document Intelligence in the current state, it would be very neat being able to provide other/additional OCR engines.
For instance if wanting to keep everything hosted locally (e.g. https://github.com/ocrmypdf/OCRmyPDF seems like a nice option), or if the results are bad from one OCR engine then another could be used for a particular recipe upload.

Describe the solution you'd like
Ideally being able to add multiple OCR engines, which can then be chosen from upon upload.
After upload completes, re-send the image to a different OCR engine if the results are not good from the first attempt.

Describe alternatives you've considered
How does the Scan feature actually work, e.g. if I have already manually OCR:ed an image (I have done this a lot over the years scanning entire cookbooks which I then run OCR for to make them searchable)?
Is the interpretation to a recipe schema happening inside Recipya or within Azure AI Document Intelligence? If it is happening inside Recipya then this is of course a much easier thing to implement.

@mblennegard mblennegard added the enhancement New feature or request label Jun 13, 2024
@reaper47 reaper47 added go Pull requests that update Go code frontend Go HTML templates labels Jun 13, 2024
@reaper47 reaper47 added this to the v1.3.0 milestone Jun 13, 2024
@reaper47
Copy link
Owner

reaper47 commented Jun 13, 2024

The interpretation to the recipe schema is done by parsing what is returned by the Azure API: https://github.com/reaper47/recipya/blob/main/internal%2Fmodels%2Focr.go#L87

Thank you for linking this package. I will think about how to approach this once v1.2.0 stable is released.

@gedw99
Copy link
Contributor

gedw99 commented Jun 19, 2024

https://github.com/tiagomelo/go-ocr is simple , maybe too simple.

it calls tesseract via an exe call, so no cgo bindings.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request frontend Go HTML templates go Pull requests that update Go code
Projects
Status: Backlog
Development

No branches or pull requests

3 participants