Generative Q&A

Using Pinecone, LangChain + OpenAI for Generative Q&A with Retrieval Augmented Generation (RAG).

Overview

Setup the knowledge base (in Pinecone)

Chunk the content
Create vector embeddings from the chunks
Load embeddings into a Pinecone index

Ask a question

Create vector embedding of the question
Find relevant context in Pinecone, looking for embeddings similar to the question
Ask a question of OpenAI, using the relevant context from Pinecone

[TODO - add diagram]

Setup

Install dependencies

pip install -r ./setup/requirements.txt

Provide Pinecone & OpenAI API Keys

cp dotenv .env
vi .env

Use the notebooks to load the data into the Pinecone index (and run samnple queries)

[TODO - non-splade example is working, update the splade example]

[TODO - show sample output]

Q&A App (using Streamlit)

Install Dependencies

[TODO - clean up requirements.txt or pipenv]

Run

streamlit run streamlit-app.py

[TODO - show example screenshot]

Next Steps

[TODO - Update to support multiple PDFs]

[TODO - add customization of look & feel]

Background

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., … Kiela, D. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, & H. Lin (Eds.), Advances in Neural Information Processing Systems (Vol. 33, pp. 9459–9474). Retrieved from https://proceedings.neurips.cc/paper_files/paper/2020/file/6b493230205f780e1bc26945df7481e5-Paper.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
setup		setup
streamlit_app		streamlit_app
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dotenv		dotenv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generative Q&A

Overview

Setup

Install dependencies

Provide Pinecone & OpenAI API Keys

Use the notebooks to load the data into the Pinecone index (and run samnple queries)

Q&A App (using Streamlit)

Install Dependencies

Run

Next Steps

Background

About

Languages

License

ben-ogden/pinecone-rag

Folders and files

Latest commit

History

Repository files navigation

Generative Q&A

Overview

Setup

Install dependencies

Provide Pinecone & OpenAI API Keys

Use the notebooks to load the data into the Pinecone index (and run samnple queries)

Q&A App (using Streamlit)

Install Dependencies

Run

Next Steps

Background

About

Topics

Resources

License

Stars

Watchers

Forks

Languages