Skip to content

VivekArgSharma/AutoRag

Repository files navigation

Autonomous RAG Pipeline Builder & Optimizer

AutoRag is an intelligent framework that automates the creation, evaluation, and optimization of Retrieval-Augmented Generation (RAG) systems.

Instead of manually tuning embeddings, chunking, retrievers, and prompts — AutoRag experiments, evaluates, and finds the best pipeline for your data automatically.

🧠 Why AutoRag?

Building a good RAG system is hard:

Which embedding model works best?

What chunk size should you use?

Which retriever + reranker combination is optimal?

How do you evaluate performance objectively?

AutoRag solves this by:

⚙️ Automatically testing multiple pipeline configurations

📊 Evaluating performance using custom datasets

🧩 Selecting the best-performing architecture

🚀 Deploying optimized pipelines instantly

✨ Features

🔄 Automated RAG Pipeline Search

Tries multiple combinations of:

Embeddings

Chunking strategies

Retrievers

Prompt templates

📊 Evaluation-Driven Optimization

Uses QA datasets to score pipelines

Selects best pipeline based on metrics

🧱 Modular Architecture

Easily plug in:

Custom LLMs

Local models

APIs (OpenAI, etc.)

⚡ End-to-End Workflow

Data → Chunking → Retrieval → Generation → Evaluation → Deployment

🖥️ Interactive Execution

Raw Documents ↓ Parsing & Cleaning ↓ Chunking Strategies ↓ Embedding + Indexing ↓ Retrieval + Reranking ↓ LLM Generation ↓ Evaluation Engine ↓ Best Pipeline Selection

git clone https://github.com/VivekArgSharma/AutoRag.git cd AutoRag pip install -r requirements.txt python main.py

Run pipelines and visualize outputs

Debug and improve results in real-time

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors