Agentic RAG Assistant

This project is a retrieval-augmented generation (RAG) AI assistant designed to answer questions strictly based on a provided set of documents. It uses LangChain for orchestration and Groq's LLM for generation.

Overview

The assistant allows users to query a knowledge base containing information about Artificial Intelligence, Biotechnology, Climate Science, Quantum Computing, Space Exploration, and Sustainable Energy. It is built to be an "Agentic" system that decides when to retrieve information and explicitly refuses to answer if the information is not present in its knowledge base.

Architecture

Document Loading: Text documents are loaded and split into chunks.
Embeddings & Vector Store: Chunks are embedded using sentence-transformers/all-MiniLM-L6-v2 and stored in a FAISS vector database.
Retrieval: A retrieval mechanism fetches relevant chunks based on semantic similarity. A strict similarity threshold is enforced to filter out irrelevant information.
Agent/Chain: A LangChain pipeline processes the user query and retrieved context.
LLM: Groq (llama-3.1-8b-instant) generates the response based only on the context.

Anti-Hallucination Measures

Retrieval-Only Answering: The system prompt explicitly forbids using external knowledge.
Similarity Threshold: If the retrieved documents do not match the query sufficiently (based on a score threshold), the system does not provide them to the context, leading to an "I don't know" response.
Strict System Prompt: The LLM is instructed to say "I do not have enough information" if the context is insufficient.

How to Run Locally

Prerequisites

Python 3.9+
A Groq API Key

Installation

Clone the repository.
Install dependencies:
```
pip install -r requirements.txt
```
Create a .env file in the root directory and add your Groq API key:
```
GROQ_API_KEY=your_actual_api_key_here
```

Running the App

Initialize the Knowledge Base: The first time you run the app, it will create the vector store from the documents in data/.
Start the UI:
```
streamlit run ui/app.py
```
Or use the helper script:
```
python main.py
```

Example Queries

In-Scope: "What is strong AI?", "How does quantum computing work?", "Explain the types of sustainable energy."
Out-of-Scope: "Who is the president of the USA?", "What is the capital of Australia?" (will be refused).

Limitations

The knowledge is limited strictly to the provided text files.
It assumes the provided documents are the sole source of truth.

Future Improvements

Add support for PDF and other file formats.
Implement conversation history memory (Multi-turn RAG).
Add citation sources to the output.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
src		src
ui		ui
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agentic RAG Assistant

Overview

Architecture

Anti-Hallucination Measures

How to Run Locally

Prerequisites

Installation

Running the App

Example Queries

Limitations

Future Improvements

About

Uh oh!

Releases

Packages

Languages

dharamshiyash/agentic-rag-assistant

Folders and files

Latest commit

History

Repository files navigation

Agentic RAG Assistant

Overview

Architecture

Anti-Hallucination Measures

How to Run Locally

Prerequisites

Installation

Running the App

Example Queries

Limitations

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages