Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New Feature to add OCR Support #500

Closed
jasondavis opened this issue Feb 11, 2015 · 11 comments
Closed

New Feature to add OCR Support #500

jasondavis opened this issue Feb 11, 2015 · 11 comments

Comments

@jasondavis
Copy link

Would be cool to see a way to integrate the command line tool Tesseract OCR to be able to upload an image with text on it and have ShareX run the image through Tesseract OCR and then upload the image and then upload the text file that Tesseract OCR creates containing the text it read and detected from the image.

@Jaex
Copy link
Member

Jaex commented Feb 13, 2015

While processing image having text output is complex workflow and I can't think easy solution for it. Probably there is many OCR software so it will be overkill to use ShareX for it anyway.

http://www.makeuseof.com/tag/5-best-ocr-tools-extracting-text-images/

@Jaex Jaex closed this as completed Feb 13, 2015
@jasondavis
Copy link
Author

understand, just wanted to share the Tesseract OCR that I see is being integrated into some other projects on GitHub right now so thought it might be a cool addon...in my opinion ShareX is absolutely perfect as it is right now! I have been using it for almost 5-6 years now and it is a piece of software that I use almost every single day! So again thank you @Jaex for keeping the project alive these past years and sharing it with the world!

Everytime I see an update, I am just amazed at the software has evolved to be more awesome over and over again! With that said, I can't wait to see what else you come up with in the future. At the same time, I don't want the project to shed away to far from it;s main purpose and become some bloated piece of crap! I am confidant that you will not allow that to happen though as you have only added awesome and useful features so far, so thanks again and look forward to the future of ShareX!

BTW I am constantly promoting the app to other people as I feel like it really is a hidden Gem! So many people would love this app if they just knew it existed and more importantly knew just how powerful and how many feature it has! It;s hard to teach people of all the good features though as people are lazy and wont take the time to discover and learn things...either way it;s there loss! I will keep spreading the word though!

  • Jason
  • Twitter - @JasonDavisFL

@MzHub
Copy link

MzHub commented Jan 13, 2016

I was about to suggest this myself, so I'll at least give my 2 cents.

@Jaex, while I agree that OCR can get complex, I think you may be thinking of a different use case than what was intended here. OCR of photos and scans is a whole different world, but a much simpler use case - and in my opinion more useful for a tool like ShareX - is copying text from screenshots or other images to the clipboard.

I used to have an application for exactly that use case years ago, and I still miss it often. It could not read anything complex, but it handled normal fonts better than OCR software meant for photos. I would love to have that in ShareX behind another screenshot hotkey, as I'm already used to how ShareX works. I would rather not get another application for that.

I disagree that this functionality would be overkill, or out of the scope of ShareX, since there is already text and clipboard handling and uploading. To me it would make sense for ShareX to combine screenshot + text via OCR. As jasondavis said, Tesseract may be one option to look into.

However, I understand if this is not a high priority :)

EDIT: Another crazy idea solution could be to let the user select an endpoint to upload the image to, and use the resulting text as output (to clipboard?). Something like that. Could be offloaded into an OCR app online.

@MzHub
Copy link

MzHub commented Jan 18, 2016

Here is another option, as used by Greenshot:
http://getgreenshot.org/faq/how-can-i-use-greenshot-for-ocr/

For the Greenshot OCR (Optical Character Recognition) plug-in to work we are depending on a component from Microsoft called MODI (Microsoft Office Document Imaging). If this is installed Greenshot will activate the plugin (if installed) and you will have an OCR destination. If you select this destination the OCR logic from Microsoft will try to detect the text on the screenshot and Greenshot will place this on the clipboard. MODI can be installed as an option from the Microsoft Office 2003 / 2007 installer or as an option from the free Microsoft SharePoint Designer 2007 ( you can find a description and a link to download it under “method 2″ here: http://support.microsoft.com/kb/982760)

@MzHub
Copy link

MzHub commented Feb 22, 2016

@Jaex on the OCR endpoint idea, here are some free solutions that some users might be able to use:

Microsoft's Computer Vision API - https://www.projectoxford.ai/pricing - free quota 5,000/month
Google's Cloud Vision API - https://cloud.google.com/vision/pricing - free quota 1,000/month
HP's OCR API - https://dev.havenondemand.com/apis/ocrdocument - free quota ?

@angularsen
Copy link

This would be great as a plugin to ShareX, or simply included out of the box.
My usecase is using ShareX to select a part of the screen, then copy the parsed text to the clipboard.

@angularsen
Copy link

Update, I just tried Capture2Text from http://www.makeuseof.com/tag/5-best-ocr-tools-extracting-text-images/ .

I works pretty much like I want to, hotkey to start a rectangle selection, move mouse then mouse click to end rectangle selection. Parsed text is then copied to clipboard. It would be awesome to have something similar in ShareX, but obviously there are working alternatives out there.

@Jaex
Copy link
Member

Jaex commented May 13, 2016

I added OCR as "Text capture" in capture menu:

It uses online API to do OCR and shows result like this:

Using online API had advantage of not including very high size library and language files to ShareX setup.

@trickbz
Copy link

trickbz commented Jun 10, 2016

Really cool feature, thanks!

But several suggestions:

  1. "Optical character recognition" window opened after each capture, and user forced to select a text and paste to clipboard. Is it possible to add additional option to just copy recognized text to clipboard (with predefined language in the settings) and continue workflow?
  2. Ctrl + A doesn't work to select all text in "Optical character recognition" window

Thanks in advance!

@GnstheGrain
Copy link

GnstheGrain commented Jan 23, 2018

@Jaex

#500 (comment)

May I ask what service provider the api is being used from ?

Also, just a short warning @ ShareX's users using this OCR feature. While I do use it myself, and totally appreciate it's integration to ShareX, please always consider that everything you scan using this API is sent directly to the OCR service provider. (so at the very least, avoid scanning password,passphrase,sensitive image, wallet address, qrcode, private key, etc...)

@Jaex : Would you know of any "on-premise" or "self-hosted" alternative we could use instead of the third-party's API for this ? Would be a great privacy enhancement.

@Jaex
Copy link
Member

Jaex commented Jan 24, 2018

https://ocr.space

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants