This repository contains an application designed to assist visually impaired individuals by providing essential functionalities such as text recognition, object detection, article searching, music detection, and more. The application leverages advanced AI models and APIs to deliver a seamless and accessible user experience.
Empowers users to search for news articles using voice commands or text input. The application fetches articles from reliable sources and reads them aloud, ensuring accessibility for visually impaired users.
Offers a conversational AI assistant capable of answering user queries, providing guidance, and performing tasks. The chatbot integrates seamlessly with other features, enhancing the overall user experience.
Leverages Spotify's API to identify and play music tracks. Users can explore songs, view track details, and enjoy a personalized music experience.
Utilizes advanced image recognition to detect and identify currency notes. This feature ensures users can handle cash transactions with confidence and ease.
Assists users in navigating their surroundings by estimating the distance to nearby objects. This feature enhances spatial awareness and safety for visually impaired individuals.
Identifies products and provides detailed information, including brand and specifications. This feature is particularly useful for shopping and daily tasks.
Extracts text from images or documents using OCR technology. The recognized text is read aloud, enabling users to access printed or handwritten content effortlessly.
- Node.js (v16 or higher)
- Python (v3.8 or higher)
- Docker (optional, for containerized deployment)
-
Navigate to the
frontenddirectory:cd frontend -
Install the required dependencies:
npm install
-
Create a
.envfile in thefrontenddirectory and configure the following environment variables:VITE_BACKEND_URL=<your_backend_url> VITE_AUDD_API_KEY=<your_audd_api_key> SPOTIFY_CLIENT_ID=<your_spotify_client_id> SPOTIFY_CLIENT_SECRET=<your_spotify_client_secret>
-
Start the development server:
npm run dev
-
Navigate to the
ml_servicedirectory:cd ml_service -
Create a virtual environment and activate it:
python -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
-
Install the required Python dependencies:
pip install -r requirements.txt
-
Create a
.envfile in theml_servicedirectory and configure the following environment variables:SERPAPI_API_KEY=<your_serpapi_api_key> GOOGLE_API_KEY=<your_google_api_key> GROQ_API_KEY=<your_groq_api_key>
-
Start the backend server:
python main.py
- Ensure that the
.gitignorefile excludes sensitive files like.envto prevent accidental commits. - For Spotify integration, run the command
node spotify_token_cli.jsscript to generate and update the necessary tokens in the.envfile. - To test the application, navigate to
http://localhost:3000for the frontend andhttp://localhost:8000for the backend.
