Skip to content

This project demonstrates the use Boto3, an AWS SDK for Python, along with Textract, a machine learning service provided by AWS, to extract text from Images.

License

Notifications You must be signed in to change notification settings

Mahitej28/Textract101

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Textract101

This project demonstrates the use Boto3, an AWS SDK for Python, along with Textract, a machine learning service provided by AWS, to extract text from Images.

🌟 Features

  • Uses Boto3 to interact with the AWS Textract service.
  • Extracts text from Images in various formats, such as jpg or png.
  • Supports processing documents stored in AWS S3 buckets or locally on the file system.
  • Provides a simple and straightforward interface for extracting text from scanned documents.

⚙️ Installation

  • Clone the repository:
git clone <repository_url>
  • Install all the required dependencies
pip install -r requirement.txt
  • Run the python Application:
python extract_text.py

Note: You need to configure your AWS Profile with the project before running.

👀 Screenshots

Screenshot

Screenshot 2024-04-12 152942

✍ Author

📌 Contributions

Contributions are welcome! If you find any bugs or have ideas for improvements, feel free to open an issue or submit a pull request.

🛡 License

You may use this project freely at your own risk. See LICENSE.

Copyright (c) 2023 Mahima Churi

Connect with me

LinkedIn     Instagram     Gmail   

(Back to top)

About

This project demonstrates the use Boto3, an AWS SDK for Python, along with Textract, a machine learning service provided by AWS, to extract text from Images.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages