Skip to content

PyWizards/TextExtract

Repository files navigation

forthebadge made-with-python

TextExtract :

Open Source Love svg1 License: MIT Downloads Python 3.6 PyPI version Python package PyWizards

Overview :

TextExtract is a optimize version of Tesseract to provide better text recognition.

Features :

  1. Extract text from any image
  2. Extract text which match expected text

Installation :

Prerequisite :

  • Python 3.6+ (https://www.python.org/downloads/)
  • Tesseract.exe file and DLLs('liblept177.dll','libtesseract400.dll','vcomp140.dll')
  • Tesseract version used is : tesseract 4.0.0-beta.3

Using pip :

pip install TextExtract

Usage :

from TextExtractApi.TextExtract import TextExtractFunctions  
  
#Get single text without comparing text with expected  
result,scale=TextExtractFunctions.image_to_string_only("MenuIcon.png",lang='eng')
  
#Get texts from list of scaled images text with highest matching with expected text
result,match=TextExtractFunctions.image_to_string_matched("MenuIcon.png",expected_text='Menu',lang='eng')
  
#Get list of result with matches results  
finalresult,scaled_results=TextExtractFunctions.image_to_string_matched("MenuIcon.png",expected_text='Menu',all_results=True)

Contact Information :

Email: sayarmendis26@gmail.com

Donation :

If you have found my softwares to be of any use to you, do consider helping me pay my internet bills. This would encourage me to create many such softwares :)

Donate via Instamojo

About

TextExtract is a optimize version of Tesseract to provide better text recognition.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages