New Feature to add OCR Support #500

jasondavis · 2015-02-11T09:57:43Z

Would be cool to see a way to integrate the command line tool Tesseract OCR to be able to upload an image with text on it and have ShareX run the image through Tesseract OCR and then upload the image and then upload the text file that Tesseract OCR creates containing the text it read and detected from the image.

Jaex · 2015-02-13T07:52:48Z

While processing image having text output is complex workflow and I can't think easy solution for it. Probably there is many OCR software so it will be overkill to use ShareX for it anyway.

http://www.makeuseof.com/tag/5-best-ocr-tools-extracting-text-images/

jasondavis · 2015-02-14T19:17:59Z

understand, just wanted to share the Tesseract OCR that I see is being integrated into some other projects on GitHub right now so thought it might be a cool addon...in my opinion ShareX is absolutely perfect as it is right now! I have been using it for almost 5-6 years now and it is a piece of software that I use almost every single day! So again thank you @Jaex for keeping the project alive these past years and sharing it with the world!

Everytime I see an update, I am just amazed at the software has evolved to be more awesome over and over again! With that said, I can't wait to see what else you come up with in the future. At the same time, I don't want the project to shed away to far from it;s main purpose and become some bloated piece of crap! I am confidant that you will not allow that to happen though as you have only added awesome and useful features so far, so thanks again and look forward to the future of ShareX!

BTW I am constantly promoting the app to other people as I feel like it really is a hidden Gem! So many people would love this app if they just knew it existed and more importantly knew just how powerful and how many feature it has! It;s hard to teach people of all the good features though as people are lazy and wont take the time to discover and learn things...either way it;s there loss! I will keep spreading the word though!

Jason
Twitter - @JasonDavisFL

MzHub · 2016-01-13T12:19:57Z

I was about to suggest this myself, so I'll at least give my 2 cents.

@Jaex, while I agree that OCR can get complex, I think you may be thinking of a different use case than what was intended here. OCR of photos and scans is a whole different world, but a much simpler use case - and in my opinion more useful for a tool like ShareX - is copying text from screenshots or other images to the clipboard.

I used to have an application for exactly that use case years ago, and I still miss it often. It could not read anything complex, but it handled normal fonts better than OCR software meant for photos. I would love to have that in ShareX behind another screenshot hotkey, as I'm already used to how ShareX works. I would rather not get another application for that.

I disagree that this functionality would be overkill, or out of the scope of ShareX, since there is already text and clipboard handling and uploading. To me it would make sense for ShareX to combine screenshot + text via OCR. As jasondavis said, Tesseract may be one option to look into.

However, I understand if this is not a high priority :)

EDIT: Another crazy idea solution could be to let the user select an endpoint to upload the image to, and use the resulting text as output (to clipboard?). Something like that. Could be offloaded into an OCR app online.

MzHub · 2016-01-18T07:40:41Z

Here is another option, as used by Greenshot:
http://getgreenshot.org/faq/how-can-i-use-greenshot-for-ocr/

For the Greenshot OCR (Optical Character Recognition) plug-in to work we are depending on a component from Microsoft called MODI (Microsoft Office Document Imaging). If this is installed Greenshot will activate the plugin (if installed) and you will have an OCR destination. If you select this destination the OCR logic from Microsoft will try to detect the text on the screenshot and Greenshot will place this on the clipboard. MODI can be installed as an option from the Microsoft Office 2003 / 2007 installer or as an option from the free Microsoft SharePoint Designer 2007 ( you can find a description and a link to download it under “method 2″ here: http://support.microsoft.com/kb/982760)

MzHub · 2016-02-22T06:22:17Z

@Jaex on the OCR endpoint idea, here are some free solutions that some users might be able to use:

Microsoft's Computer Vision API - https://www.projectoxford.ai/pricing - free quota 5,000/month
Google's Cloud Vision API - https://cloud.google.com/vision/pricing - free quota 1,000/month
HP's OCR API - https://dev.havenondemand.com/apis/ocrdocument - free quota ?

angularsen · 2016-04-02T23:15:07Z

This would be great as a plugin to ShareX, or simply included out of the box.
My usecase is using ShareX to select a part of the screen, then copy the parsed text to the clipboard.

angularsen · 2016-04-02T23:24:51Z

Update, I just tried Capture2Text from http://www.makeuseof.com/tag/5-best-ocr-tools-extracting-text-images/ .

I works pretty much like I want to, hotkey to start a rectangle selection, move mouse then mouse click to end rectangle selection. Parsed text is then copied to clipboard. It would be awesome to have something similar in ShareX, but obviously there are working alternatives out there.

Jaex · 2016-05-13T22:40:26Z

I added OCR as "Text capture" in capture menu:

It uses online API to do OCR and shows result like this:

Using online API had advantage of not including very high size library and language files to ShareX setup.

trickbz · 2016-06-10T09:37:11Z

Really cool feature, thanks!

But several suggestions:

"Optical character recognition" window opened after each capture, and user forced to select a text and paste to clipboard. Is it possible to add additional option to just copy recognized text to clipboard (with predefined language in the settings) and continue workflow?
Ctrl + A doesn't work to select all text in "Optical character recognition" window

Thanks in advance!

GnstheGrain · 2018-01-23T17:52:30Z

@Jaex

#500 (comment)

May I ask what service provider the api is being used from ?

Also, just a short warning @ ShareX's users using this OCR feature. While I do use it myself, and totally appreciate it's integration to ShareX, please always consider that everything you scan using this API is sent directly to the OCR service provider. (so at the very least, avoid scanning password,passphrase,sensitive image, wallet address, qrcode, private key, etc...)

@Jaex : Would you know of any "on-premise" or "self-hosted" alternative we could use instead of the third-party's API for this ? Would be a great privacy enhancement.

Jaex · 2018-01-24T00:54:04Z

https://ocr.space

Jaex added the Enhancement label Feb 11, 2015

Jaex closed this as completed Feb 13, 2015

vatterspun mentioned this issue Sep 11, 2019

Add a simple offline OCR #4269

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Feature to add OCR Support #500

New Feature to add OCR Support #500

jasondavis commented Feb 11, 2015

Jaex commented Feb 13, 2015

jasondavis commented Feb 14, 2015

MzHub commented Jan 13, 2016

MzHub commented Jan 18, 2016

MzHub commented Feb 22, 2016

angularsen commented Apr 2, 2016

angularsen commented Apr 2, 2016

Jaex commented May 13, 2016 •

edited

trickbz commented Jun 10, 2016 •

edited

GnstheGrain commented Jan 23, 2018 •

edited

Jaex commented Jan 24, 2018

New Feature to add OCR Support #500

New Feature to add OCR Support #500

Comments

jasondavis commented Feb 11, 2015

Jaex commented Feb 13, 2015

jasondavis commented Feb 14, 2015

MzHub commented Jan 13, 2016

MzHub commented Jan 18, 2016

MzHub commented Feb 22, 2016

angularsen commented Apr 2, 2016

angularsen commented Apr 2, 2016

Jaex commented May 13, 2016 • edited

trickbz commented Jun 10, 2016 • edited

GnstheGrain commented Jan 23, 2018 • edited

Jaex commented Jan 24, 2018

Jaex commented May 13, 2016 •

edited

trickbz commented Jun 10, 2016 •

edited

GnstheGrain commented Jan 23, 2018 •

edited