Private Retrieval Augmented Generation (PRAG)

PRAG provides a solution to augment Large Language Models (LLM) with the ability to securely query external data sources, ensuring context-specific accuracy without compromising data privacy. While LLMs like GPT demonstrate significant potential in generalizing tasks through prompt engineering, their static view and occasional 'hallucinations' due to lack of specific context can be limiting. PRAG bridges this gap by introducing a unique retrieval phase, allowing LLMs to leverage private data in inference without exposing sensitive information. This repository offers a first-of-its-kind approach, focusing on both the retrieval and inference phases, incorporating secure solutions like Community Transformers, MPCFormer, and PUMA.

Setup

The two key requirements you will need for this are crypten for MPC computations and beir for information retrieval benchmarking.

Structure

The actual MPC functions using crypten (currently only cos and dot product) are present in mpc_functions.py. These can be imported, but require some light wrapping to unpickle the outputs.

LocalDenseRetrievalExactSearch.py provides an IR wrapper (adapted from the beir package) that can be used to embedding datasets and queries using various transformer models and then perform k nearest search over the embeddings. This alteration of the class allows for embeddings to be pre-generated and stored to file (which will help us speed things up a lot!). Note that the default beir search process uses chunked retrieval, which speeds things up a lot on real datasets!

MPCDenseRetrievalExactSearch.py extends the DenseRetrievalExactSearch (DRES) class to allow for MPC computations. This class also acts as a standard wrapper for the MPC functions in mpc_functions.py.

Running

A good place to start is the beir_MPC_example.py. The setup function (not executed by default) will allow you to create a set of embeddings from a standard dataset (e.g. MS MARCO) and store them to file. You can also embed a sample of the dataset for testing purposes. This file will guide you through using the MPC and non-MPC functions to perform a k-nearest search over the dataset and benchmark.

If you're interested in accuracy or speed on synthetic data, you can run speed_vs_size_scaling_evaluation.py. This will generate a random dataset and query vector and perform a k-nearest search using both MPC and non-MPC functions. The results are printed to the console.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.gitignore		.gitignore
LICENSE		LICENSE
LocalDenseRetrievalExactSearch.py		LocalDenseRetrievalExactSearch.py
MPCDenseRetrievalExactSearch.py		MPCDenseRetrievalExactSearch.py
README.md		README.md
beir_MPC_example.py		beir_MPC_example.py
beir_MPC_runs.py		beir_MPC_runs.py
beir_approx_knn.py		beir_approx_knn.py
beir_default_example.py		beir_default_example.py
embed_all_datasets.py		embed_all_datasets.py
final_plots.py		final_plots.py
guy_cosine_sim_test.py		guy_cosine_sim_test.py
is_ivf_mpc_faster_than_exact.py		is_ivf_mpc_faster_than_exact.py
is_mpc_ivf_the_same_as_ivf.py		is_mpc_ivf_the_same_as_ivf.py
langchain_example.py		langchain_example.py
mpc_functions.py		mpc_functions.py
mpc_functions_stable.py		mpc_functions_stable.py
mpc_testsuite.py		mpc_testsuite.py
mpcmodels.py		mpcmodels.py
ptmodels.py		ptmodels.py
speed_vs_size_scaling_evaluation.py		speed_vs_size_scaling_evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Private Retrieval Augmented Generation (PRAG)

Setup

Structure

Running

About

Releases

Packages

Contributors 2

Languages

License

tobinsouth/prag

Folders and files

Latest commit

History

Repository files navigation

Private Retrieval Augmented Generation (PRAG)

Setup

Structure

Running

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages