DocuChat is an advanced chatbot that empowers you to explore and understand your data effortlessly. It integrates Langchain, Langsmith, Unstructured Loader, ChromaDB, and Streamlit to offer dynamic, context-aware interactions for deeper insights into your PDF data.
- Extracts and analyzes text, tables, and images from PDF documents.
- Uses sophisticated LLMs to provide context-aware responses.
- Supports dynamic interactions through a Streamlit web interface.
- Python 3.7 or higher
- pip
-
Clone the repository:
git clone https://github.com/dcprakash/DocuChat.git cd docuchat -
Create and activate a virtual environment:
python3 -m venv venv source venv/bin/activate -
Install the required dependencies:
pip install -r requirements.txt
-
Create a
.envfile in the project directory and add your OpenAI API key:OPENAI_API_KEY=your_openai_api_key
To run the application, execute the provided shell script:
./setup_and_run.shThe application will start, and you can access it in your web browser at http://localhost:8501.
Contributions are welcome! Please read the CONTRIBUTING.md file for guidelines.
This project is licensed under the MIT License. See the LICENSE file for details.
Created by Darshan Chinvar. Reach out at chinvarpd@gmail.com.