Nexus AI: Multimodal Semantic Search Engine

An internship project built for the Endee AI Challenge. This application performs high-accuracy semantic image retrieval through a Telegram interface.

🏗️ System Design (Adaptive Retrieval Architecture)

My implementation follows a production-style AI pipeline designed for low-latency search:

Preprocessing Layer: Images stored in /pics are loaded via Pillow and normalized for the AI model.
Embedding Engine: Uses the CLIP (ViT-B-32) transformer model. This is a multimodal model that understands the relationship between natural language and visual pixels, converting them into 512-dimensional vectors.
Local Vector Database: To ensure portability and overcome environment constraints with Docker, I developed a custom JSON-based Vector Store (vector_db.json). This stores the file paths and their corresponding high-dimensional embeddings.
Search & Similarity Logic:

The system calculates the Cosine Similarity between the user's text query and all stored image vectors.
Adaptive Filtering: I implemented a custom "Smart Gate." If the top match is significantly stronger than the rest, only 1 result is sent. If multiple images are highly relevant (within a 90% similarity threshold), the bot adaptively returns the top 2.

🛠️ How I Built It

Semantic Search: Unlike keyword search, this bot understands concepts. Searching "mammal" will find a "bear" even without that specific word in the filename.
Python-Telegram-Bot: A clean, real-world interface for user interaction.
NumPy Math: Used for fast vector normalization and dot-product calculations.

🚀 Setup Instructions

1. Install Dependencies

Run the following command to install the required AI libraries:

pip  install  sentence-transformers  pillow  python-telegram-bot  numpy

2. Index Your Images

Ensure your images are in the /pics folder, then run the engine to generate your vector database:

python  engine.py

3. Start the Bot

Update the token in bot.py and run the main service to go live:

python  bot.py

📝 Project Evolution Note

Originally designed for the Endee Docker environment, I successfully pivoted to a Local Vector Engine implementation to ensure 100% functionality and easier evaluation for the technical team during this 24-hour challenge.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github/workflows		.github/workflows
__pycache__		__pycache__
doc		doc
docs		docs
infra		infra
pics		pics
src		src
tests		tests
third_party		third_party
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
bot.py		bot.py
docker-compose.yml		docker-compose.yml
engine.py		engine.py
install.sh		install.sh
run.sh		run.sh
vector_db.json		vector_db.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nexus AI: Multimodal Semantic Search Engine

🏗️ System Design (Adaptive Retrieval Architecture)

🛠️ How I Built It

🚀 Setup Instructions

1. Install Dependencies

2. Index Your Images

3. Start the Bot

📝 Project Evolution Note

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Nexus AI: Multimodal Semantic Search Engine

🏗️ System Design (Adaptive Retrieval Architecture)

🛠️ How I Built It

🚀 Setup Instructions

1. Install Dependencies

2. Index Your Images

3. Start the Bot

📝 Project Evolution Note

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages