Atlas Vector Search with RAG

The Python scripts in this repo use Atlas Vector Search with Retrieval-Augmented Generation (RAG) architecture to build a Question Answering application. They use the LangChain framework, OpenAI models, as well as Gradio in conjunction with Atlas Vector Search in a RAG architecture, to create this app.

Setting up the Environment

Install the following packages:

pip3 install langchain pymongo bs4 openai tiktoken gradio requests lxml argparse unstructured

Create OpenAI API Key from here. Note that this requires a paid account with OpenAI, with enough credits. OpenAI API requests stop working if credit balance reaches $0.
Save the OpenAI API key and the MongoDB URI in the key_param.py file, like this:

openai_api_key = "ENTER_OPENAI_API_KEY_HERE"
MONGO_URI = "ENTER_MONGODB_URI_HERE"

Use the following two python scripts:
- load_data.py: This script will be used to load your documents and ingest the text and vector embeddings, in a MongoDB collection.
- extract_information.py: This script will generate the user interface and will allow you to perform question-answering against your data, using Atlas Vector Search and OpenAI.

Note: In this demo, I've used:

DB Name: langchain_demo
Collection Name: collection_of_text_blobs
The text files that I am using as my source data are saved in a directory named sample_files.

Main Components

LangChain	OpenAI	Atlas Vector Search	Gradio
DirectoryLoader: - All documents from a directory - Split and load - Uses the Unstructured package	Embedding Model: - text-embedding-ada-002 - Text → Vector embeddings - 1536 dimensions	Vector Store	UI for LLM app - Open-source Python library - Allows to quickly create user interfaces for ML models
RetrievalQA: - Retriever - Question-answering chain	Language model: - gpt-3.5-turbo - Understands and generates natural language - Generates text, answers, translations, etc.
MongoDBAtlasVectorSearch: - Wrapper around Atlas Vector Search - Easily create and store embeddings in MongoDB collections - Perform KNN Search using Atlas Vector Search

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

extract_information.py

extract_information.py

load_data.py

load_data.py

Repository files navigation

Atlas Vector Search with RAG

Setting up the Environment

Main Components

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
extract_information.py		extract_information.py
load_data.py		load_data.py

mongodb-developer/atlas-vector-search-rag

Folders and files

Latest commit

History

Repository files navigation

Atlas Vector Search with RAG

Setting up the Environment

Main Components

About

Resources

Stars

Watchers

Forks

Languages