Telegram Bot for Automated Report Processing

A smart Telegram bot for automated processing of field crew reports using AI-powered photo analysis and file organization.

Features

24/7 Autonomous Mode — runs continuously and processes new reports automatically
Smart Timer — waits for the crew to finish uploading all files (configurable timeout)
Smart Filtering — processes only UA/UB topics, ignores administrative and TSS/TSSR ones
Two-Stage AI Analysis — initial photo classification + outlier re-analysis using series context
VDO Support — automatic detection of VDO topics and equipment list merging with the main topic
Crash Recovery — restores unfinished tasks from the database on restart
Producer-Consumer Architecture — parallel processing of multiple topics via a task queue
Monitoring — healthcheck system with periodic status reports to Telegram

Installation

Install dependencies:

pip install -r requirements.txt

Copy .env.example to .env and fill in the environment variables:

Variable	Required	Description	Default
`TELEGRAM_API_ID`	yes	API ID from my.telegram.org	—
`TELEGRAM_API_HASH`	yes	API Hash from Telegram	—
`TELEGRAM_PHONE_NUMBER`	yes	Phone number for authentication	—
`TARGET_CHAT_ID`	yes	Target supergroup ID	—
`GEMINI_API_KEY`	yes	API key from Google AI Studio	—
`WAIT_TIME`	no	Upload wait time (seconds)	`300`
`MAX_WORKERS`	no	Number of parallel workers (1–8)	`2`
`GEMINI_MODEL`	no	Gemini model for analysis	`gemini-flash-latest`
`DB_PATH`	no	Path to SQLite database	`bot_memory.db`
`HEALTHCHECK_INTERVAL`	no	Telegram report interval (seconds)	`3600`
`DOWNLOADS_DIR`	no	Temporary downloads directory	`downloads`
`SESSION_FILE`	no	Telethon session file	`session_name.session`

Running

Docker (recommended for production)

# Build and start in the background
docker-compose up -d --build

# View logs
docker logs -f work_auto_bot

# Stop
docker-compose down

On the first run you may need to start the container interactively (docker-compose run -it bot) to enter the Telegram verification code or 2FA password. The session is persisted for subsequent runs.

Docker container includes:

Health checks every 60 seconds
Memory limit 1 GB, CPU limit 1.0
Automatic restart (unless-stopped)
Volume mounts for data persistence
Timezone Europe/Kiev

Local

python main.py

Systemd Service (Linux)

A ready-to-use work_auto.service file is provided for systemd deployment.

How It Works

The bot continuously listens for messages in UA/UB topics
When new activity is detected, it starts a timer (WAIT_TIME)
If new files arrive — the timer resets
Once the timer expires, the bot downloads everything and starts AI analysis
Stage 1 — classify each photo (dismantling / installation / other)
Stage 2 — detect outliers in the series and re-analyze them using neighboring photo context
Creates a ZIP archive with sorted files and a README report
For VDO topics — automatically finds the paired topic and merges data

Topic Filtering

Processes topics with "UA" or "UB" in the title:

UA4571 Демонтаж/Монтаж
UB9999 Оборудование

Ignores:

Topics with "TSS" or "TSSR" in the title
Administrative topics ("Общие", "Топливо", "План", etc.)

Output

The bot automatically:

Downloads all photos and text messages
Extracts equipment lists from text via AI
Analyzes photos through Google Gemini (two-stage process)
Sorts files into folders:
- Demontaj/ — dismantled equipment
- Montaj/ — installed equipment
- Other/ — miscellaneous photos
Generates a README.txt with a detailed report
Packages everything into a ZIP archive named by site ID

Output File Structure

output/
├── UA4571_MAIN.zip
├── UA4571_VDO.zip          # if a VDO topic exists
downloads/
├── topic_12345/            # temporary folder (deleted after processing)
│   ├── messages_log.txt
│   ├── README.txt
│   ├── Demontaj/
│   │   ├── Anten_T1003M6R011_photo_123.jpg
│   │   └── DCDU12B_photo_124.jpg
│   └── Montaj/
│       └── RRU_5516_photo_125.jpg

Project Structure

├── main.py                     # entry point (autonomous mode)
├── requirements.txt            # production dependencies
├── requirements-dev.txt        # development dependencies
├── Dockerfile                  # Docker image (Python 3.11-slim)
├── docker-compose.yml          # Docker Compose configuration
├── pytest.ini                  # test configuration
├── work_auto.service           # systemd service
├── check_models.py             # utility to check available Gemini models
├── modules/
│   ├── telegram_client.py      # Telegram client with event-driven architecture
│   ├── ai_processor.py         # two-stage AI photo analysis
│   ├── task_queue.py           # task queue (Producer-Consumer)
│   ├── database.py             # SQLite state persistence
│   └── utils.py                # utilities (ZIP, README, site ID extraction)
├── tests/
│   ├── test_ai_processor.py
│   ├── test_database.py
│   ├── test_error_scenarios.py
│   ├── test_integration_pipeline.py
│   ├── test_telegram_logic.py
│   └── test_utils.py
├── .github/
│   └── workflows/
│       └── ci.yml              # CI/CD: ruff + bandit + pytest
├── logs/                       # rotating logs (5 MB x 5 files)
├── downloads/                  # temporary processing files
└── output/                     # final ZIP archives

Architecture

Telegram NewMessage Event
        ↓
   Topic Timer (resets on new messages)
        ↓
   Task Queue (FIFO)
        ↓
   Worker Pool (2–8 parallel workers)
        ↓
   Pipeline: Download → AI Analysis → Packaging → Delivery
        ↓
   SQLite (state persistence)

Event-driven — reacts to new messages in real time
Producer-Consumer — efficient processing via a task queue
Async/Thread hybrid — async for Telegram, ThreadPool for AI and file I/O
Persistence — SQLite (WAL mode) stores processing state and enables crash recovery
Caching — LRU cache for topic titles (500 entries, 1-hour TTL)

Development

Install dev dependencies

pip install -r requirements-dev.txt

Run tests

pytest

CI/CD

GitHub Actions runs automatically on push to main and on pull requests:

Lint — ruff check (code style)
Security — bandit (vulnerability scanning)
Tests — pytest (unit and integration tests)

Logging and Monitoring

Log file: logs/bot.log (rotation: 5 MB x 5 files)
Healthcheck: file updated every 30 seconds (Docker checks every 60 seconds)
Telegram reports: periodic messages with metrics (processed, errors, timing)

Important Notes

Make sure the bot has read access to messages in the target group
A valid GEMINI_API_KEY is required for AI analysis
First-time authentication may require entering a Telegram verification code
The session is persisted in SESSION_FILE
The database (bot_memory.db) stores processing progress — do not delete it unnecessarily

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Telegram Bot for Automated Report Processing

Features

Installation

Running

Docker (recommended for production)

Local

Systemd Service (Linux)

How It Works

Topic Filtering

Output

Output File Structure

Project Structure

Architecture

Development

Install dev dependencies

Run tests

CI/CD

Logging and Monitoring

Important Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
modules		modules
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
docker-compose.yml		docker-compose.yml
healthcheck		healthcheck
main.py		main.py
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
work_auto.service		work_auto.service

Folders and files

Latest commit

History

Repository files navigation

Telegram Bot for Automated Report Processing

Features

Installation

Running

Docker (recommended for production)

Local

Systemd Service (Linux)

How It Works

Topic Filtering

Output

Output File Structure

Project Structure

Architecture

Development

Install dev dependencies

Run tests

CI/CD

Logging and Monitoring

Important Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages