A Gradio App for Retrieval-Augmented-Generation on PDFs
RAGify is a tool for querying PDFs on the fly. PDFs are uploaded from the host computer, subject to OCR, vectorized using langchain.text_splitter
functions, and stored in a temporary vector database, which is searched using chroma
's query function. Results are fed to the user's model of choice (currently either tinyllama-1.1b-chat-v1.0.Q3_K_M.gguf and llama-2-7b-chat.Q5_K_M.gguf).
- Download and add desired LLMs to
models/
. \ Note: Current version of the app accepts tinyllama-1.1b-chat-v1.0.Q3_K_M.gguf or llama-2-7b-chat.Q5_K_M.gguf. - Add the prompt templates in the
rag()
function and in the dropdown menu (via themodels
list). - Load dependencies
- Run the app
Feel free to submit a pull request for any issues or improvements!
Author: Megan Chiovaro (@mchiovaro)