Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

does this work with opencv? #67

Closed
Raj2032 opened this issue May 8, 2024 · 5 comments
Closed

does this work with opencv? #67

Raj2032 opened this issue May 8, 2024 · 5 comments

Comments

@Raj2032
Copy link

Raj2032 commented May 8, 2024

can this work with opencv, the idea is that have a camera and in real time it can detect words by in real time covnerting words into text?

@robertknight
Copy link
Owner

robertknight commented May 8, 2024

I haven't tried using this with OpenCV, but in principle it should be possible. Inference speed will depend a lot on the hardware you run it on. On my 4-year old Intel laptop it takes about 1s end-to-end for a typical image. If using the Rust library you can run just the text detection, which will be quicker (~200ms) and then run recognition only if something is seen.

@Raj2032
Copy link
Author

Raj2032 commented May 8, 2024 via email

@robertknight
Copy link
Owner

Sorry I didn't quite understand, are you saying don't run it as a software, but instead use it as a library and incorporate it into my opencv code?

You can do either. The command-line tool has the advantage you can call it from any tool which can run a program. The library gives you more flexibility to eg. run only detection, or process a batch of images at once etc.

@Raj2032
Copy link
Author

Raj2032 commented May 9, 2024

@robertknight Hey mate, so what is meant by "run only detection"? What else does the program do?

@robertknight
Copy link
Owner

If you run the ocrs program normally on an image, the output is the text that was found. Internally it uses a three-step process where it first detects which pixels in the input are text, then attempts to determine how the text pixels should be grouped into words and lines, then attempts to read the text from each line. If you are using the library, you can run each of these steps independently. So if you only need to know whether an image contains text, but not what the text is, you can run just the first step.

Repository owner locked and limited conversation to collaborators May 10, 2024
@robertknight robertknight converted this issue into discussion #69 May 10, 2024

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Projects
None yet
Development

No branches or pull requests

2 participants