rag-api

Introduction

This repository hosts a FastAPI-based API for Retrieval-Augmented Generation (RAG), a cutting-edge technique in Natural Language Processing that enhances response quality by retrieving relevant information during the generation process. Our implementation focuses on speed, scalability, and ease of integration utilizes Milvus for efficient vector storage.

Features

Fast response times suitable for real-time applications.
Scalable architecture, handling concurrent requests efficiently.
Customizable retrieval component for domain-specific applications.

Default Configuration

Embedding Model: intfloat/multilingual-e5-small (Embedding dimensions: 384)
Indexing algorithm: HNSW (maxConnections: 32, efConstruction: 128, ef: 64)
Similarity metric: Cosine distance

Note: The API design is highly flexible, allowing for straightforward substitutions of the embedding model and indexing algorithm. Users can easily replace 'intfloat/multilingual-e5-small' with their preferred model and switch out the HNSW indexing algorithm in Milvus, adapting the system to diverse requirements and use cases.

Supported file types

pdf
txt
docx / doc
html
md
py
js
json
csv
xml
xlsx / xls

Install

bash setup.sh

Run

bash setup.sh

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
api_testing		api_testing
milvus		milvus
README.md		README.md
embeddings.py		embeddings.py
install_pandoc.py		install_pandoc.py
load.py		load.py
pandoc-3.1.6.2-1-amd64.deb		pandoc-3.1.6.2-1-amd64.deb
requirements.txt		requirements.txt
run.sh		run.sh
server.py		server.py
setup.sh		setup.sh
split.py		split.py
vectorstore.py		vectorstore.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rag-api

Introduction

Features

Default Configuration

Supported file types

Install

Run

About

Releases

Packages

Languages

Mutahar789/rag-api

Folders and files

Latest commit

History

Repository files navigation

rag-api

Introduction

Features

Default Configuration

Supported file types

Install

Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages