🤖 RAG Assistant

A modular Retrieval-Augmented Generation (RAG) pipeline for document search, chunking, embedding, and conversational AI.

🚀 Features

Document Ingestion: Flexible ingestion for markdown, text, and more.
Chunking: Smart document chunking for optimal retrieval.
Embeddings: Integrates with state-of-the-art embedding models.
Vector Search: Fast, scalable semantic search using ChromaDB.
Conversational Engine: Chat interface for natural language queries.
Extensible: Modular core for easy customization and extension.

🗂️ Folder Structure

ai-tech-rag/
├── assets/                # Visuals, custom commands, and static assets
├── chroma_db/             # ChromaDB vector database files
├── qwerty/                # Python virtual environment
├── rag-assistant/         # Main app and core modules
│   ├── app.py             # Flask app entry point
│   ├── core/              # Core RAG logic (chat, vector, document)
│   ├── static/            # Frontend static files (CSS, JS)
│   └── templates/         # Jinja2 HTML templates
├── techcorp-docs/         # Example documents for ingestion
├── test_*.py              # Unit and integration tests
├── requirements.txt       # Python dependencies
└── README                 # This file

🛠️ Quickstart

# 1. Clone the repo
git clone https://github.com/MansurPro/RAG-assistant.git
cd ai-tech-rag

# 2. Create and activate a virtual environment (optional)
python3 -m venv qwerty
source qwerty/bin/activate

# 3. Install dependencies
pip install -r requirements.txt

# 4. Run the app
python rag-assistant/app.py

💡 Usage

Ingest Documents:
- Place your files in techcorp-docs/ or use ingest_documents.py.
Search & Chat:
- Use the web UI at http://localhost:5000 to chat and search.
Test the Pipeline:
- Run pytest or use the provided test scripts.

🌐 Visual Overview

📦 Key Modules

core/document_processor.py – Document loading, chunking, and preprocessing
core/vector_engine.py – Embedding and vector search logic
core/chat_engine.py – Conversational AI and retrieval logic
app.py – Flask app and API endpoints

🧪 Testing

pytest
# or run individual test files, e.g.
python test_chunking.py

🤝 Contributing

Pull requests and issues are welcome! For major changes, please open an issue first to discuss what you would like to change.

📄 License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
rag-assistant		rag-assistant
techcorp-docs		techcorp-docs
.gitignore		.gitignore
README.md		README.md
chunk-test.txt		chunk-test.txt
embedding-test.txt		embedding-test.txt
ingest-complete.txt		ingest-complete.txt
ingest_documents.py		ingest_documents.py
init_vectordb.py		init_vectordb.py
rag-pipeline-test.txt		rag-pipeline-test.txt
requirements.txt		requirements.txt
search-results.txt		search-results.txt
test_chunking.py		test_chunking.py
test_embeddings.py		test_embeddings.py
test_rag_pipeline.py		test_rag_pipeline.py
test_search.py		test_search.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 RAG Assistant

🚀 Features

🗂️ Folder Structure

🛠️ Quickstart

💡 Usage

🌐 Visual Overview

📦 Key Modules

🧪 Testing

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Languages

MansurPro/RAG-assistant

Folders and files

Latest commit

History

Repository files navigation

🤖 RAG Assistant

🚀 Features

🗂️ Folder Structure

🛠️ Quickstart

💡 Usage

🌐 Visual Overview

📦 Key Modules

🧪 Testing

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages