Visionary Assistant is an innovative tool that captures your screen, analyzes the content using OpenAI's advanced GPT-4 model, and provides auditory feedback on what it sees. This project is perfect for users who want to leverage AI for real-time screen analysis and auditory response.
- Screen capture and processing.
- Image analysis using OpenAI's GPT-4 model.
- Auditory feedback generated from the analysis.
- Python 3.6 or higher
- Pip (Python package installer)
-
Clone the Repository
git clone https://github.com/your-username/visionary-assistant.git cd visionary-assistant
-
Install Dependencies
pip install -r requirements.txt
-
Set Up Environment Variables
- Create a new file named
.env
in the root directory of the project. - Add your OpenAI API key to the .env file:
- Create a new file named
Run the following command to start the application:
python main.py