rag-dive

Working code to implement RAG for Question Answering on the SQuAD dataset.

This is the code that goes along with our Practical ML Dive into RAG.

Generate context data

oxen download ox/SQuAD dev.csv
python generate_context.py dev.csv dev_contexts.jsonl

Compute embeddings

TODO: take in CLI args

python compute_embeddings.py

Download embeddings from Oxen

oxen download oxbot/SQuAD-Dev-Embed-4 dev_contexts_embeddings.parquet

Setup Chroma

https://docs.trychroma.com/troubleshooting#sqlite

pip install chromadb==0.4.3

vim ~/.venv_rag/lib/python3.11/site-packages/chromadb/__init__.py

Add these few lines...

__import__('pysqlite3')
import sys
sys.modules['sqlite3'] = sys.modules.pop('pysqlite3')

import chromadb
chroma_client = chromadb.Client()

collection = chroma_client.create_collection(name="squad_embeddings")

Insert all the embeddings into chroma.

TODO: Make cli params work

python index_into_chroma.py -i embeddings.parquet -o chroma.db

Compute Recall

Figure out how well the embeddings retrieval system works

TODO: Take in N as CLI param

python compute_recall.py ~/Datasets/Not-In-Context/squad_dev.jsonl chroma-dev.db results.jsonl

Compute Precision

Figure out how well we can extract the answer from the context

python compute_precision.py -m meta-llama/Llama-2-7b-chat-hf -d ~/Datasets/SQuAD-Context/experiments/dev-recall-3.jsonl -o ~/Datasets/SQuAD-Context/experiments/dev-llama-recall-3-precision-3-shot.jsonl -n 3

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
RAG Experiments.ipynb		RAG Experiments.ipynb
README.md		README.md
close_but_not_right_data.py		close_but_not_right_data.py
compute_embeddings.py		compute_embeddings.py
compute_embeddings_local.py		compute_embeddings_local.py
compute_embeddings_oxen.py		compute_embeddings_oxen.py
compute_precision.py		compute_precision.py
compute_precision_mamba.py		compute_precision_mamba.py
compute_recall.py		compute_recall.py
create_sft_data.py		create_sft_data.py
embeddings.py		embeddings.py
generate_context.py		generate_context.py
index_into_chroma.py		index_into_chroma.py
index_oxen_docs.py		index_oxen_docs.py
merge_lora_model.py		merge_lora_model.py
predict.py		predict.py
query.py		query.py
query_chroma.py		query_chroma.py
query_no_context.py		query_no_context.py
squad_cleanup.py		squad_cleanup.py
stream_embeddings.py		stream_embeddings.py
synthetic_data.py		synthetic_data.py
train.py		train.py
util.py		util.py

License

Oxen-AI/rag-dive

Folders and files

Latest commit

History

Repository files navigation

rag-dive

Generate context data

Compute embeddings

Download embeddings from Oxen

Setup Chroma

Compute Recall

Compute Precision

About

Resources

License

Stars

Watchers

Forks

Languages