This repository is an exercise in developing Retrieval-Augmented Generation (RAG) using the Mistral 7B LLM and Milvus database. The application is designed to run locally within Docker containers.
The dataset used for this project: Wikipedia Movie Plots - Kaggle
To set up the environment and get started with the project, follow these steps:
-
Clone the repository and navigate to its directory.
-
Initialize and activate the environment:
pipenv --python 3.11 pipenv install pipenv run python -m ipykernel install --user --name="da_$(basename $(pwd))" --display-name="da_$(basename $(pwd))"
-
Restart your code editor to recognize the new environment.
-
Open the
main.ipynbnotebook and select the appropriate kernel and interpreter. -
Run the notebook to start interacting with the RAG setup.
For additional notes and development progress, refer to devjournal.ipynb.
main.ipynb- The primary notebook for running RAG processes.devjournal.ipynb- The development work log and notes.