-
Notifications
You must be signed in to change notification settings - Fork 6.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[New PowerToy] Create new OCR PowerToy #19172
Conversation
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
What do you mean with mvp. Do you mean this PR is WIP? |
@Aaron-Junker by MVP I meant that the region selection OCR fully works just that it is a Minimally Viable Product as it stands currently. If anyone wants to pull my branch and build, it will work for them. I will edit the original comment to be more clear. This is still a WIP and I will be working to get it where it needs to be for full inclusion into PowerToys. |
Oh. Thank you for the explanation. I thought maybe you mixed it up. |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
Please don't forget to add this to the issue templates. |
Just did. Going with minimum lovable product here. What's remaining in terms of features would go in different issues. |
@jaimecbernardo |
Was about to add a comment about that. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Everything works nicely on single and multi monitor setup. Nice work!
If we add the context menu for images. Then the source is an image and not the screen. |
I am onboard with changing the name away from PowerOCR. "OCR Copy" is a good name, when thinking up a name I did web searches and found many of the "easy OCR" "quick OCR" "Fast OCR" etc. all already in use. |
So, the plan is to currently merge as "PowerOCR" and change it later if needed. |
@jaimecbernardo |
|
Win+Shift+O is used by VCM already for camera muting. |
Although, Win+T and Win+Shift+T are shortcuts for cycling forward and backwards on the taskbar icons. Going to check if it can be overridden on Windows 11 as well and then apply the change. |
Tested locally, got a crash once during OCR:
|
@yuyoyuppe We should probably wrap the Clipboard access in a try block
and probably log an error if access to the clipboard fails. |
How does color picker safe guard this. |
@crutkas try-catch with log. @TheJoeFin I already love this tool! 😃
|
Would love the ability to run OCR in bulk on a folder of pictures, and have the ability to search images by text it contains. Probably out of the scope of this but would be incredibly handy for bulk scanning or finding things in a screenshots folder |
we should move to clipboard logic to common lib so we reuse same logic. |
@astandarduser please create new issue as "new feature" request. Would love to understand more the "why" but very interesting concept |
This is SOO nice. I did use Text Grab but having this as part of PowerToys will make it something I know will just always be there for me. Nits:
Thank you for making the "dimming" be instant and not fade. If you do change to fade, make that an optional thing please. @crutkas, should the above go in a new issue at this point also? |
Summary of the Pull Request
This PR introduces a new PowerToy to perform OCR anywhere on screen by selecting a rectangular region, clicking a word, or right-clicking an image file and selecting PowerOCR.
As of the first week in July, there is much more that needs to be done to make this PowerToy ready for prime time, but this is the MVP if anyone wanted to pull and build just this tool. Most of the code is copied from my repository Text Grab.
PR Checklist
Detailed Description of the Pull Request / Additional comments
Validation Steps Performed