GitHub - alexdali/telethon_bot: Telegram bot that recognizes text in images and PDF files

OCR Bot for Telegram

🤖 Telegram bot that recognizes text in images and PDF files
Use it live here: https://t.me/text_from_image_bot

📝 Table of Contents

About
Demo / Working
How it works
Usage
Roadmap
Built Using
Authors
License
Acknowledgments

🧐 About

A simple and convenient telegram bot that extracts text from images or PDFs after the user uploads these files to the bot. The result of processing files by the text recognition service can be obtained in one of the following ways: message or text file.

🎥 Demo / Working

ocr in pdf files	ocr in image files
pdf-ocr.mp4	image-ocr.mp4

💭 How it works

The bot uses the Telegram API to communicate with the user and send messages to them. After the user starts the bot by entering the "/start " command, it is ready to accept the file for processing.

The user is provided with default text recognition settings: the text language is English, the content format is plain text, and the recognition result is displayed as a message. You can change these settings using the inline menu buttons. Since the bot uses a free text recognition service, there are restrictions that can be found by clicking on the button: "Limits".

As soon as the bot receives a valid file from the user, it uses the OCR API https://ocr.space/ to get the result of the text recognition service in JSON format. This information is then converted into a message that is sent to the user using the Telegram API.

Current limitations of the free OCR API service:

supported file formats: PDF, PNG, JPG( JPEG), BMP, TIF (TIFF), GIF
Maximum file size-1 MB, maximum number of pages in a PDF file-3
the limit on the number of requests to the API service is 500 requests / day.

The bot uses the Telethon python library to interact with the Telegram API.

The entire bot is written in Python 3.7

🎈 Usage

To use the bot, type:

/start

You can change the text recognition settings: text language (24 languages are supported), content format - plain text or table, recognition result - message or text file. You can change these settings using the inline menu buttons. You can change these settings during use by calling the command:

/settings

Please note: The bot could be slow sometimes as it depends on OCR.space's API requests.

⛏️ Roadmap

Add the ability to process files by URL
~~Add warning when exceeding the page limit in pdf file~~ DONE
Add the ability to process text over 4096 characters
Anti-flood protection
Refactoring

⛏️ Built Using

Telethon - Telethon is an asyncio Python 3 MTProto library to interact with Telegram's API as a user or through a bot account (bot API alternative).
ocr.space - Free Online OCR - Convert images and PDF to text
Logging - Logging library for debugging

✍️ Authors

Alexey Tasbauov

📗 License

This project is licensed under the MIT License - see the LICENSE file for more details.

🎉 Acknowledgements

Thank you to Telethon for providing the python wrapper!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
api_service/ocr		api_service/ocr
menu		menu
service		service
.gitignore		.gitignore
README.md		README.md
bot.py		bot.py
requirements.txt		requirements.txt
run.py		run.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Bot for Telegram

📝 Table of Contents

🧐 About

🎥 Demo / Working

💭 How it works

🎈 Usage

⛏️ Roadmap

⛏️ Built Using

✍️ Authors

📗 License

🎉 Acknowledgements

About

Releases

Packages

Languages

alexdali/telethon_bot

Folders and files

Latest commit

History

Repository files navigation

OCR Bot for Telegram

📝 Table of Contents

🧐 About

🎥 Demo / Working

💭 How it works

🎈 Usage

⛏️ Roadmap

⛏️ Built Using

✍️ Authors

📗 License

🎉 Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages