Art-guide

A telegram bot that can be your audio guide at Art Museums. Send a photo of an artwork, and it will generate an audio description to which you can listen to.

Try it out: https://t.me/harbour_art_guide_bot

We developed this project during the class in Harbour.Space.

How it works?

Read a HackMD article about the project

Modules:

Image Cropper

This module is in charge of segmenting the image and cropping the relevant part of the image (for e.g. the painting in the image).

You can go here for more details about the image crop module or read this blog post.

Reverse Image Search

This module is in charge of finding the most similar image to the input image in the database and return the relevant information about the image.

You can go here for more details about the reverse image search module or read this blog post.

Description Generation

This module receives the relevant information of the found image and generates a description of the image, in a specific format. You can go here for more details about the descriptor module or read this blog post.

Text to Speech

This module receives the description and generates an audio file of the description. You can go here for more details about the text to speech module.

Bot Flow

The bot receives the image from the user.
The bot sends the image to the image cropper module.
The bot sends the cropped image to the reverse image search module.
The reverse image search module finds the most similar image to the input image in the database, and returns it back to the bot with the relevant information.
The bot evaluates if the found image is similar enough to the input image. In case it isn't, the bot tells the user that it couldn't find the image. In case it is, the bot sends the relevant information of the found image to the description generation module.
The bot sends the description to the text to speech module.
The bot sends the audio file to the user.

CI/CD

This project uses GitHub Actions for CI/CD. The workflow is defined in .github/workflows/main.yml. The workflow is triggered on every push or merge request to the main branch. The workflow consists mainly in a build job that builds the docker image and pushes it to the GitHub Container Registry. For more information please read CI/CD README

Setting up the project

There are 3 ways to run the project:

Run the project directly running the python file.
Build the Docker image and run it.
Run the docker image from the GitHub Container Registry.

Environment

For the first 2 options you need to set up the environment variables

Create the environment file (on a .env file by using .env.example as template).
Set your own keys (telegram token).

Run the project directly running the python file

You need 2 concurrent terminals to run the project.

Terminal 1 (Rest API)

gunicorn -w 4 -k uvicorn.workers.UvicornWorker main:app

Terminal 2 (Telegram Bot)

python3 main.py

Build and run the Docker image

docker build -t art-guide-tg-bot .
docker run art-guide-tg-bot

Run the docker image from the GitHub Container Registry

docker pull arielxx/art-guide-tg-bot:latest
docker run arielxx/art-guide-tg-bot:latest

How to use the modules

from descriptor_module import descriptor
from image_crop_module import croper

...

image = Image.open(path)

# input: Pilow image, output: np.array
image_croped = croper.crop_image(image)

...
description = descriptor.describe(features_dict)

print(description)

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.dvc		.dvc
.github/workflows		.github/workflows
data		data
descriptor_module		descriptor_module
image_crop_module		image_crop_module
logs		logs
reverse_image_search_module		reverse_image_search_module
tests		tests
text2speech_module		text2speech_module
.dvcignore		.dvcignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
bot.py		bot.py
main.py		main.py
main_web.py		main_web.py
mocker.py		mocker.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run_ml_backend.sh		run_ml_backend.sh
services.py		services.py
supervisord.conf		supervisord.conf
test.py		test.py
utils.py		utils.py

License

aguschin/art-guide

Folders and files

Latest commit

History

Repository files navigation