Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GSSoC'24: OCR Detection (Image to Text) #56

Closed
SAM-DEV007 opened this issue May 14, 2024 · 3 comments · Fixed by #62
Closed

GSSoC'24: OCR Detection (Image to Text) #56

SAM-DEV007 opened this issue May 14, 2024 · 3 comments · Fixed by #62
Assignees
Labels
enhancement New feature or request good first issue Good for newcomers gssoc Associated with GSSOC level3

Comments

@SAM-DEV007
Copy link
Contributor

Is your feature request related to a problem? Please describe.
A feature which enhances the functionality of the predictions of danger, or suicidal thoughts. When the victim is in danger and is unable to speak at the moment, a text can be displayed to the camera which can predict the outcomes and notify the nearby users accordingly. For suicide predictions, it is helpful in the situation where the victim is writing the death note, so that the camera may get a glimpse of the note, and via the predictions of the model, it can infer the outcomes.

Describe the solution you'd like
The sole purpose is to provide functionality for image to text conversion. For visualization, the frame will have the detected text surrounded with bounding boxes with the text detected and the confidence, if the latter surpasses a certain threshold. A pretrained model from easyocr will be used along with cv2.

Use Case
The extracted text can be used with the existing models in the repository that takes text as their primary input for various predictions.

Please assign it to me under GSSoC'24.

@TAHIR0110
Copy link
Owner

The use case you've described raises an important point. If a person is capable of opening the camera, they might also be able to directly access an SOS button or emergency feature. In such cases, direct access to emergency features would likely be more efficient and reliable than relying on image-to-text conversion to predict danger. However, there might be some cases where this would be useful therefore I will still assign this issue.

@TAHIR0110 TAHIR0110 added gssoc Associated with GSSOC level1 enhancement New feature or request good first issue Good for newcomers labels May 14, 2024
@SAM-DEV007
Copy link
Contributor Author

SAM-DEV007 commented May 14, 2024

@TAHIR0110 A pull request has been created.

@ananyabansal16
Copy link

If this issue is still pending, I can take this up under GSSoC'24. Lemme know. :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers gssoc Associated with GSSOC level3
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants