Skip to content

DishaAgarwalla/DocQuery

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📄 DocQuery - Intelligent Document QA System

Live Demo License: MIT

DocQuery is an intelligent document question-answering system that allows users to upload PDF documents and ask questions in natural language. The system uses advanced NLP techniques to provide accurate answers with page citations.

✨ Features

  • 📚 PDF Upload & Processing - Upload multiple PDF documents
  • 🤖 AI-Powered Answers - Uses FLAN-T5 for accurate responses
  • 📍 Page Citations - Answers include exact page numbers
  • 🌙 Dark Mode - Toggle between light and dark themes
  • 📋 Copy Answers - One-click copy to clipboard
  • 📥 Export Q&A - Save conversations as text files
  • 💡 Question Suggestions - AI-generated follow-up questions
  • 📊 Document Comparison - Compare answers across documents

🛠️ Tech Stack

  • Frontend: Streamlit
  • PDF Processing: PyPDF
  • Text Chunking: LangChain
  • Embeddings: Sentence Transformers (all-MiniLM-L6-v2)
  • Vector Store: FAISS
  • LLM: FLAN-T5-small

📦 Installation (Local)

git clone https://github.com/DishaAgarwalla/DocQuery.git
cd DocQuery
pip install -r requirements.txt
streamlit run streamlit_app.py

📝 How to Use

  1. Upload PDF files via sidebar
  2. Click "Process New Documents"
  3. Wait for processing to complete
  4. Ask questions in natural language
  5. Get answers with page citations

🙏 Acknowledgements

  • Streamlit for the amazing framework
  • Hugging Face for FLAN-T5 model
  • FAISS for vector similarity search

👨‍💻 Author

Disha Agarwalla


Made with ❤️ by Disha Agarwalla

About

AI-powered document querying platform that allows users to upload PDFs and ask questions in natural language. Uses LLMs and semantic search to extract meaningful insights and provide contextual answers from documents.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages