Skip to content

Chat with your PDF files using LLM model and VectorDatabase

License

Notifications You must be signed in to change notification settings

IMvision12/RAG-LlamaIndex

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RAG-LlamaIndex

RAG-LlamaIndex is a project aimed at leveraging RAG (Retriever, Reader, Generator) architecture along with Llama-2 and sentence transformers to create an efficient search and summarization tool for PDF documents. This tool allows users to query information from PDF files using natural language and obtain relevant answers or summaries.

Sample Input and Output

The data utilized in this scenario were research papers on Llama and Gemma.

Query: Which GPU was used to train LLaMA-65B? Output :

Loading checkpoint shards: 100% 2/2 [01:02<00:00, 31.34s/it]
LLM Output: LLaMA-65B was trained on A100-80GB.

Query: What is count of Embedding Parameters for gemma-2b and gemma-7b? Output :

Loading checkpoint shards: 100% 2/2 [01:02<00:00, 31.34s/it]
According to the context information, the count of embedding parameters for gemma-2b is 524,550,144, and for gemma-7b, it is 786,825,216.

Setup 💻

  1. Clone Github Repo:
$ git clone https://github.com/IMvision12/RAG-LlamaIndex.git
$ cd RAG-LlamaIndex
  1. Install Libraries
$ pip install -r requirements.txt
  1. Get PDF data

The provided links will download pdf files, which will then be stored in a folder named "data". If you have your own PDF files, please relocate them to the "data" folder.

$ python utils.py --links https://arxiv.org/pdf/2302.13971 https://arxiv.org/pdf/2403.08295
  1. Run Main.py
$python main.py --data-directory "/content/RAG-LlamaIndex/data" \
                --llm-model "meta-llama/Llama-2-7b-chat-hf" \
                --embed-model "sentence-transformers/all-mpnet-base-v2" \
                --hf-api "Your HuggingFace Access Token" \
                --query "Enter your Query!"

Releases

No releases published

Packages

No packages published

Languages