RAG Chatbot

A production-grade Retrieval-Augmented Generation (RAG) chatbot built with LangChain, FAISS, Groq, and Streamlit. Upload documents or scrape websites and chat with your knowledge base using free LLMs.

Features

Document Loaders — PDF (PyPDF2), DOCX, TXT, Markdown
Smart Chunking — Recursive, Character, Token-based strategies
Free Embeddings — sentence-transformers/all-MiniLM-L6-v2 (runs locally)
Hybrid Search — FAISS dense + BM25 sparse with Reciprocal Rank Fusion
Free LLMs — Groq (llama-3.1, mixtral, gemma) + HuggingFace
RAG Evaluation — RAGAS framework (Faithfulness, Answer Relevancy, Context Precision)
Web Scraping — BeautifulSoup + Selenium fallback
Multilingual — Auto-detect + translate (13 languages)
Export Chat — TXT, CSV, JSON, PDF
Conversation Memory — Sliding window history with standalone query rewriting

Quick Start

1. Clone the repository

git clone https://github.com/ronitgulia/RAG-Chatbot
cd RAG-Chatbot

2. Install dependencies

pip install -r requirements.txt

3. Get a free Groq API key

Sign up at console.groq.com — it's completely free.

4. Run the app

streamlit run app.py

5. Usage

Paste your Groq API key in the sidebar → click Connect LLM
Go to Documents tab → upload files or scrape URLs
Go to Chat tab → ask questions
View Evaluation tab for quality metrics
Export your conversation in any format

Project Structure

RagChatBot/
├── app.py                  # Streamlit UI (Chat, Documents, Evaluation, Export)
├── rag_pipeline.py         # Core orchestrator
├── document_loader.py      # PDF, DOCX, TXT loaders
├── text_chunker.py         # LangChain text splitters
├── embeddings.py           # Sentence-transformers wrapper
├── vector_store.py         # FAISS + BM25 hybrid store
├── llm_provider.py         # Groq + HuggingFace providers
├── evaluation.py           # RAGAS evaluation pipeline
├── web_scraper.py          # Web scraping module
├── multilingual.py         # Language detection & translation
├── export_utils.py         # Chat export utilities
├── styles.py               # Custom Streamlit CSS
├── config.py               # Centralized configuration
├── requirements.txt        # Dependencies
└── .env.example            # Environment variable template

Evaluation Metrics

Metric	Description
Faithfulness	Is the answer grounded in retrieved context?
Answer Relevancy	Does the answer address the question?
Context Precision	Are retrieved chunks relevant to the query?

Available LLM Models (Groq — Free)

Model	Context	Best For
`llama-3.1-8b-instant`	128k	Fast responses (default)
`llama-3.3-70b-versatile`	128k	Best quality
`deepseek-r1-distill-llama-70b`	128k	Reasoning tasks
`gemma2-9b-it`	8k	Balanced
`mixtral-8x7b-32768`	32k	Long documents

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Chatbot

Features

Quick Start

1. Clone the repository

2. Install dependencies

3. Get a free Groq API key

4. Run the app

5. Usage

Project Structure

Evaluation Metrics

Available LLM Models (Groq — Free)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.devcontainer		.devcontainer
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
app.py		app.py
config.py		config.py
document_loader.py		document_loader.py
embeddings.py		embeddings.py
evaluation.py		evaluation.py
export_utils.py		export_utils.py
llm_provider.py		llm_provider.py
multilingual.py		multilingual.py
rag_pipeline.py		rag_pipeline.py
requirements.txt		requirements.txt
styles.py		styles.py
test_core.py		test_core.py
text_chunker.py		text_chunker.py
vector_store.py		vector_store.py
web_scraper.py		web_scraper.py

Folders and files

Latest commit

History

Repository files navigation

RAG Chatbot

Features

Quick Start

1. Clone the repository

2. Install dependencies

3. Get a free Groq API key

4. Run the app

5. Usage

Project Structure

Evaluation Metrics

Available LLM Models (Groq — Free)

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages