Document Q&A System

A modern, AI-powered document question and answer system built with Python and OpenAI's API.

🚀 Features

Modern GUI: Sleek CustomTkinter interface with dark theme
Multiple AI Models: Choose from GPT-4o Mini, GPT-4o, GPT-3.5 Turbo, or GPT-4
Smart PDF Processing: Intelligent text chunking with overlap for better context
Vector Search: Advanced similarity search for relevant document sections
Real-time Q&A: Interactive question answering with progress indicators

📁 Project Structure

document_searcher/
├── main.py                 # Original command-line interface
├── modern_gui.py          # Modern GUI application
├── run_gui.py             # GUI launcher script
├── query_manager.py       # Command-line query interface
├── qa_agent.py            # AI model interface
├── document_loader.py     # PDF processing and chunking
├── embeddings_manager.py  # Vector embeddings management
├── vector_search.py      # Similarity search
├── pdf_metadata.py       # PDF metadata extraction
├── model_comparison.py   # Model performance testing
├── gui_app.py            # Original GUI (legacy)
├── api_testing.py        # API testing utilities
├── view_usage.py         # Usage monitoring
└── README.md             # This file

🛠️ Installation

Clone the repository:

git clone https://github.com/JamesKruik/document-qa-system.git
cd document-qa-system/document_searcher

Install dependencies:

pip install openai python-dotenv PyPDF2 numpy customtkinter

Set up environment variables:
- Create a .env file in the document_searcher directory
- Add your OpenAI API key:
```
OPENAI_API_KEY=your_api_key_here
```

🎮 Usage

GUI Application (Recommended)

python modern_gui.py

Command Line Interface

python main.py

Query Manager (Interactive CLI)

python query_manager.py

Model Comparison

python model_comparison.py

🤖 Available AI Models

Model	Cost	Accuracy	Best For
GPT-4o Mini	Low	Good	Simple questions
GPT-4o	Medium	Excellent	Most tasks
GPT-3.5 Turbo	Very Low	Medium	Technical content
GPT-4	High	Outstanding	Complex reasoning

📖 How It Works

Load PDFs: Select and load your PDF documents
Process Documents: Create vector embeddings for intelligent search
Ask Questions: Type questions and get accurate answers
Choose Models: Select the AI model that fits your needs

🔧 Technical Details

PDF Processing: PyPDF2 for text extraction
Text Chunking: Smart chunking with sentence boundary detection
Embeddings: OpenAI's text-embedding-3-small model
Vector Search: Cosine similarity with multiple chunk retrieval
GUI Framework: CustomTkinter for modern interface

📝 Example Questions

"How do I configure the device settings?"
"What are the technical specifications?"
"What troubleshooting steps are available?"

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Submit a pull request

📄 License

This project is open source and available under the MIT License.

🙏 Acknowledgments

OpenAI for the powerful API
CustomTkinter for the modern GUI framework
PyPDF2 for PDF processing capabilities

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Document Q&A System

🚀 Features

📁 Project Structure

🛠️ Installation

🎮 Usage

GUI Application (Recommended)

Command Line Interface

Query Manager (Interactive CLI)

Model Comparison

🤖 Available AI Models

📖 How It Works

🔧 Technical Details

📝 Example Questions

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
api_testing.py		api_testing.py
document_loader.py		document_loader.py
embeddings_manager.py		embeddings_manager.py
gui_app.py		gui_app.py
main.py		main.py
model_comparison.py		model_comparison.py
modern_gui.py		modern_gui.py
pdf_metadata.py		pdf_metadata.py
qa_agent.py		qa_agent.py
query_manager.py		query_manager.py
requirements.txt		requirements.txt
run_gui.py		run_gui.py
test_markdown_demo.py		test_markdown_demo.py
vector_search.py		vector_search.py
view_usage.py		view_usage.py

JamesKruik/document-qa-system

Folders and files

Latest commit

History

Repository files navigation

Document Q&A System

🚀 Features

📁 Project Structure

🛠️ Installation

🎮 Usage

GUI Application (Recommended)

Command Line Interface

Query Manager (Interactive CLI)

Model Comparison

🤖 Available AI Models

📖 How It Works

🔧 Technical Details

📝 Example Questions

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages